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USE OF TRANSLATIONALI Y Al TERFn Rma rn r p NFgR RFSL^TAN icg TO MAITc 
DWARF MOSAIC VIRUS AND OTHPq MnMnrn T YLEDDMn. ifi pi anT VIRI ]«;pc; ^ 



The invention relates generally to the genetic engineering of oionocotyledonous 
plants to resist virus infection through the expression of inhibitory transcripts or proteins 
derived from the inhibited virus. In another aspect, the invention relates to the elucidation 
and characterization of the genomic structure and organization of a maize dwarf mosaic 



virus. 



Plant viruses are a major problem in agriculture and cause significant losses in crop 
yield each year. In the past, available approaches for combating plant viruses were primarily 
limited to the selection of plant lines which exhibited genetic resistance to virus infection 
and the application of chemicals designed to protect plants from the organisms responsible 
for introducing the virus to the plant (i.e. viral vectors). 

Recently, a number of approaches for combating plant viruses have been developed 
Which are based upon the transfom,ation of susceptible plant species with chimeric genes 
Which express transcripts or proteins that inhibit viral infection. These approaches include 
genetically engineering plants to express viral coat protein or coat protein transcripts, viral 
replicases in unmodified or modified form, antisense genes or ribozymes targeting viral 
genomic RNA or transcripts, and altered viral transcripts (for a review, see Rtchen. J.H et 
al.. Ann. Rev. Microbiol. 47: 739-763 (1993)). To apply any of these approaches, knowledge 
of the structure and organization of the genome of the target virus is necessary. 

With respect to the expression of altered viral transcripts to confer viral resistance, 
limited success has been reported in dicotyledonous plants through the expression of viral 
coat protein transcripts which have been modified to render them incapable of translation 
Expression of such "umranslatable" viral transcripts ,n tobacco has been .-eported to inhibit 
tobacco etch virus (Lindbo. J.A. et al.. Mol. Plant-Microbe Int. 5(2): 144-153 (1992); Undbo 
J.A. et al.. Virology 189: 725-733 (1992); WO 93/17098 to Dougherty. W.G. et al. (Sept. 2 ' 
1993); Lindbo. J.A. et al.. The Plant Cell 5: 1749-1759 (1993)). tomato spotted wilt virus 
(Pang. S. et al.. Bio/Technology 11: 819-824 (1993): DeHaan et al., Bio/Technology 10: 
1 133-1137 (1992) and potato vims Y (Van der Vlug R.A. et al., Plant Mol. Biol. 17: 431- 
439(1991). 
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The ab,iity of such untranslatable HNAs to inhibit viral infection does not appear to be 
universal, however. Failure of such altered viral transcnpts to inhibit viral infection have 
been reported for tobacco mosaic virus (Powell, P.A. et al.. Virology 175: 124-130 (1990) 
and zucchini yellow mosaic virus (Fang. G. et al.. Mol. Plant-Microbe Int. 5(3): 358-367 
(1993). a potyvirus similar to tobacco etch virus. Additional unreponed failures may also 
exist, since such negative results are rarely published. 

The most prevalent virus infecting maize in the United States and Europe is maize 
dwarf mosaic vin,.s (MDMV). This virus is classified as a member of a family of plant vimses 
khown as the potyviruses. The potyviruses are the largest group of plant viruses and are 
charactenzed by a long, flexuous rod particle morphology and are non-persistently 
transmitted by aphid vectors (see Hollings. M. and Brunt, A., pages 732-807 of "Handbook 
of Plant Virus Infection and Comparative Diagnosis", ad. by E. Kurstak. pub by 
Hsevier/Norfh Holland Biomedical Press. Amsterdam (1981)). The potyviruses have a 
genome composed of a single strand positive sense messenger RNA molecule which is 
transcnbed and translated as one polyprotein that is subsequently cleaved into its 
component parts. The family is composed of many taxonomic strains, with the two most 
common being strains A and B. These strains are differentiated by the ability of MDMV-A to 
-nfect johnsongrass which is the ovenvintering host. MDMV-A is primanly localized to the 
southeastern United States due to the occurrence of johnsongrass in this area. MDMV-B is 
more widespread and can be found in the U.S. com belt and throughout Europe (i.e. Soain 
France, and Italy). MDMV-B is the most economically important maize virus due to its 
widespread occurrence. 

Viral diseases of maize result in an estimated 5% annual yield reduction as well as 
reduce overall plant health which results in increased susceptibility to other pathogens 
Experimental trials using common maize inbreds and hybrids have shown yield reductions 
from MDMV as great as 35% in inoculated plots. MDMV is a ma|or crop pest in maize where 
.t causes mosaic symptoms and dwariing of infecting plants, ultimately reducing crop yields 
(Knoke, J.K. et al.. pages 235-281 of 'Diseases of Cereals & Pulses", volume I ed by 
Singn, U.S. et al.. pub. by Prentice Hall. Englewood Cliffs. NJ (1992)). When found in 
combination with maize chlorotic mottle virus (MCMV). a synergistic condition known as com 
lethal necrosis results causing even more severe crop damage (see Uyemoto J K pages 
141-143 of -Proc. infl. Maize Virus Disease Colloq. & Workshop', ed. by Gordon D T -t 
al.. pub. by Ohio State Univ. and Ohio Agnc. Res. Dev. Center. Wooster. MA (1983) 
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The economic impact of yield losses due to MDMV has generated considerable 
interest in developing strategies to combat this virus. To date, however, oniy limited success 
has been achieved in reducing the adverse impact of this virus. Thus there remains a need 
to identify additional effective means for protecting host plants from MDMV. 

Both strains A and B of MDMV are transmitted in nature by aphids in a non-persistent 
manner, thus insect control is not a practical control method. The most effective method of 
control of these diseases is the use of resistant germplasm. In maize, sources of resistant 
germplasm exist to both strains of the MDMV, but the efficacy of the resistance is somewhat 
variable and identification of this material can be difficult. Studies have shown that 
resistance to MDMV is not the result of a single, dominant gene, but rather being multigenic 
(2-5 genes). There has been an abundance of research on the development of alternative 
strategies for conferring resistance in transgenic plants. Most of these strategies have 
focused on the expression of viral genes (i.e. the viral coat protein) in plants as a means of 
conferring resistance. The benefits of these strategies are that the resistance can be 
developed to viruses in which effective natural resistance can not be identified and the 
resistance is more easily transfen-ed to agronomically desirable plant lines. The majority of 
this work has focused on coat protein mediated resistance which is based on the 
expression of the viral coat protein in the plant. Coat protein mediated resistance has been 
particularly effective for some viruses (e.g. tobacco mosaic virus) but inconsistent for other 
viruses (e.g. potyviruses) when tested in model systems such as tobacco and in 
economically important grain crops such as maize, wheat, and rice. 

More recently, another virus resistance strategy has been developed which confen-ed 
an immune phenotype in plants transformed with segments of virus sequence. The 
phenomenon has been termed RNA-mediated resistance and is thought to be similar to 
sense suppression or co-suppression described in other plant systems. Specifically, plants 
were transformed with a sequence encoding the virus coat protein which had been modified 
to cause premature termination during translation. The expression of this untranslatable 
viral coat protein sequence at high levels was hypothesized to activate a RNA degradation 
cycle which eliminated the transgene mRNA in a sequence specific manner. The pathway 
was then believed to be capable of also eliminating an infecting virus which contains 
sequence highly homologous (>90%) to the transgene sequence. Since tho original 
description of RNA-mediated resistance (see Undbo. J.A et al.. Mol. Plant-Microbe Int. 5(2): 
144-153 (1992) and DeHaan et ai.. Bio/Technology 10: 1 133-1 137 (1992)), there have 
been additional descriptions of this form of resistance. Furthermore, it has been shown that 
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prior work thought to be resistance due to expression of a viral protein is more likely to be 
RNA-mediated resistance. However, this strategy has not been effective for all viruses (see 
Powell. P.A. et al.. Virology 175: 124-130 (1990) and Fang. G. et al.. Mol. Plant-Microbe InL 
6(3): 358-367 (1993)). The examples of RNA-mediated resistance have been limited to 
model dicot hosts such as tobacco and potato. It is not known if this resistance will be 
effective in monocots nor what factors will be necessary for induction of this resistance. 

The genomic stnjcture and organization of MDMV has remained largely 
uncharacterized except for the elucidation of viral coat protein coding sequences (see 
Frenkel, M. J. et al. J. Gen. Virol. 72:237-242. (1991); see also Murray. LE. etal., 
Bio/Technology 1 1 : 1559-1564 (1993)). As a result, it is currently not possible to apply many 
of the more recent recombinant-DNA based approaches that have been used for combating 
plant viruses to MDMV. These approaches require a more extensive understanding of the 
stnjcture and organization of the genome of the target virus than is currently available for 
MDMV. 



In one aspect, the present invention provides a method for protecting a 
monocotyledonous plant from infection by a virus by producing in such a plant an RNA 
molecule whose sequence corresponds, at least in part, to a mRNA or the plus strand RNA 
produced by the vims. The RNA molecule produced according to the method of the 
invention is modified so that it cannot be translated completely as compared to the viral 
RNA to which it con-esponds. Included within this aspect of the invention are chimeric 
genes designed to express such modified RNA molecules in monocotyledonous plants, as 
well as monocotyledonous plants containing such chimeric genes stably integrated into their 
genome. Such plants and their progeny are protected from infection by monocotyledonous 
vimses that produce messenger or plus-sense RNA which share sequence identity with the 
modified RNA molecule encoded and expressed by the statjiy integrated chimeric gene. 

Another aspect of the invention is based upon structural and organizational 
information that has been elucidated for the genome of strain B of Maize Dwarf Mosaic 
Virus (MDMV-B) upstream of the coat protein gene. Included in this aspect of the invention 
are chimeric genes designed to express coding sequences for MDMV-B proteins including 
the coat protein (nucleotides 7308-8291 of SEQ ID No. 1). the RNA dependent RNA 
polymerase (RdRp) (nucleotides 5745-7307 of SEQ ID No. 1 ). proteinase (nucleotides 
4452-5744 of SEQ ID No. 1). a 6K protein (nucleotides 4293-4451 of SEQ ID No. 1). 
cylindrical inclusion protein (CIP) (nucleotides 2376-4292 of SEQ ID No. l). P3 proteinase 
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(nucleotides 11 34-2375 of SEQ ID No. 1). and a portion of the helper component-P2 
proteinase (HC-Pro) (nucleotides 3-1 1 33 of SEQ ID No. 1 ). Methods for protecting plants 
from MDMV infection by transfonnrng them with these chimeric genes are included within 
this aspect of the invention along with the resulting transgenic plants and their progeny. 

The MDMV-B coding sequences may also be modified according to the first aspect of 
the present invention so that the RNA derived therefrom cannot be property transited. The 
present invention includes chimeric genes designed to express such translationally altered 
MDMV-B RNAs in plants. Methods for protecting plants from MDMV infection by 
transfomning them with these chimeric genes are included within this aspect of the invention 
along with the resulting transgenic plants and their progeny. 

The following sequences according to the invention are disclosed in the sequence listing: 

SEQ ID No. 1 : Sequence of the polycistronic messenger RNA of maize dwarf mosaic 
virus, strain B. 

SEQ ID NO. 2: Sequence of the polyprotein encoded by the polycistronic messenger RNA 

of maize dwarf mosaic virus, strain B. 
SEQ !D No. 3: Rrst internal control alcohol dehydrogenase PGR primer used in analysis 

of To plants as described in Example 4. 
SEQ ID No. 4: Second intemal control alcohol dehydrogenase PGR primer used in 

analysis of To plants as described in Example 4. 
SEQ ID No. 5: First PGR primer for the synthetic PAT gene used in analysis of To plants 

as described in Example 4. 
SEQ ID No. S: Second PGR primer for the synthetic PAT gene used in analysis of To 

plants as described in Example 4. 
SEQ ID No. 7: Rrst PGR primer for the NIa proteinase gene used in analysis of To plants 

as described in Example 4. 
SEQ ID No. 3: Second PGR primer for the NIa proteinase gene used in analysis of To 

plants as described in Example 4. 

For purposes of describing the present invention, the term "translationally altered 
RNA- is used to refer to a modified form of a naturally occurring messenger RNA sequence 
which cannot be completely translated compared to the unmodified, naturally occurring 
fornn. A translationally altered RNA may be incapable of being translated at all or it may be 
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capable of being partially rranslated into an attenuated peptide corresponding to a portion 
of the peptide encoded by the naturally occurring messenger RNA sequence from which the 
translationally altered RNA is derived. 

The coding sequence for a naturally occurring viral RNA sequence may be modified to 
encode a translationally altered RNA, for example, by removing its ATG initiation codon or 
by utilizing a portion which does not include the initiation codon. Other means for 
translationally altering a naturally occurring viral RNA molecule include introducing one or 
more premature stop codons and/or interrupting the reading frame. 

The basis for the present invention is two-fold. The first basis for the present 
invention is the discovery that reduced susceptibility to infection by a virus may be centered 
upon a monocotyledonous plant by producing in the plant a translationally altered RNA 
molecule corresponding in sequence to a plus-sense or messenger RNA molecule of the 
target virus. The second basis for the present invention is the elucidation and 
characterization by the inventors of the genomic structure and organization of strain B of 
maize dwarf mosaic vims (MDMV-B). These two bases are addressed consecutively below 
and are both represented by the examples demonstrating resistance to MDMV-B via 
expression of a translationally altered RNA in a transgenic maize plant. 

The first aspect of the present invention is direaed to a general method for reducing 
the susceptibility of a monocotyledonous plant to viral infection by proaucing in the plant a 
translationally altered RNA molecule corresponding to a messenger RNA sequence of the 
target vims. Vimses infecting monocotyledonous plants will be refen^ed to as 
monocotyledonous vimses. A method is provided for protecting progeny of a 
monocotyledoneous parent plant from viral infection by transforming said parent plant with a 
chimeric gene comprising a monocotyledonous plant promoter operably linked to a 
nucleotide sequence derived from the genomic sequence of a virus infecting 
monocotyledoneous plants, wherein said nucleotide sequence contains a modification 
rendering a messenger RNA transcribed from said nucleotide sequence incapable of 
complete translation, and obtaining progeny plants. Alternatively, said progeny of a parent 
plant can be protected from viral infection by breec* g the parent plant with a 
monocotyledonous plant having an inheritable trait of resistance to infection due to its 
expression of a chimeric gene comprising a monocotyledonous plant promoter operably 
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linked to a nucleotide sequence derived from th genomic sequence of a virus infecting 
monocotyledoneous plants, wherein said nucleotide sequence contains a modification 
rendering a messenger RNA transcribed from said nucleotide sequence incapable of 
complete translation 

The pretended approach for producing the translationally altered RNA molecule in a 
monocotyledonous plant is by introducing a chimeric gene designed to express this 
molecule into the genome of the plant. Such a chimeric gene will consist of a plant 
promoter operably linked to a nucleotide sequence derived from the genomic sequence of a 
virus infecting monocotyledoneous plants, wherein said nucleotide sequence contains a 
modification rendering a messenger RNA transcribed from said nucleotide sequence 
incapable of complete translation. 

The promoter component may be any monocotyiodoneous plant promoter that is any 
promoter which is capable of regulating or directing the expression of an operably linked 
gene in the targeted monocotyledonous plant. Such promoters are well known in the art. 
Preferably, a promoter which is capable of directing strong constitutive expression is used. 
Such promoters include, but are not limited to, the maize ubiquitin promoter described in 
Toki et al.. Plant Physiol. 100; 1503-1507 (1992) , the maize phosphoenolpymvate 
carboxylase (PEPC) promoter as described in Hudspeth, R.L et al.. Plant Molec. Biol. 12: 
579-589 (1 989). and the CaMV 35S promoter as described in Kay et al.. Science 236: 
1299-1302(1987). 

The coding sequence component comprises a sequence which, when transcribed, 
produces a translationally altered RNA molecule corresponding to a target viral sequence. 
The target viral sequence is a messenger RNA (mRNA) molecule of the target virus, or a 
portion thereof. Since the target viral sequence is naturally translatable when a translation 
initiation codon is present, it is modified so as to render It translationally altered. For any 
given target viral sequence, the skilled artisan will be able to determine various 
modifications which could be made to render the resulting RNA molecule translationally 
altered. 

Translation of an mRNA molecule in a plant cell generally requires the presence of an 
initiation AUG codon followed by an uninterrupted string of amino acid codons (known as 
the reading frame) ending with a transiational stop codon, which may be either UAA, UAG 
or UGA. A DNA molecule encoding a translatable mRNA molecule may be modified to 
encode a translationally altered RNA, for instance, by either removing the initiation ATG 
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codon, interrupting the reading frame, adding premature stop codons, or by a combination 
of these modifications. 

Introduction of one or more premature stop codons (encoded by DNA codons TAA, 
TAG or TGA) in a target viral sequence may be accomplished by adding or deleting 
nucleotides or by modifying existing nucleotides using standard techniques such as site 
directed mutagenesis or mutagenesis by PCH. Adding or deleting nucleotides may have 
the additional benefit of interrupting the reading frame, which also has the effect of 
translatlonally altering the RNA molecule. While the addition of a premature stop codon 
anywhere along the length of the target viral sequence will render it translationally altered 
as that tenn is used herein to describe the invention, it is preferable to introduce such stop 
codons near the 5' end of the target viral mRNA so that any attenuated peptides which may 
be produced via partial translation are 20 amino acids or less in length. 

The reading frame of a target viral sequence may be intenupted by the addition or 
deletion of nucleotides in the DNA coding sequence. As with the addition of premature stop 
codons. it is preferable to interrupt the reading frame near the 5' end of the target viral RNA 
so that any attenuated peptides con-esponding to a portion of the peptide encoded by the 
target viral RNA which may be produced via partial translation are 20 amino acids or less in 
length. 

Another way to translationally alter the target viral sequence is to remove the 
translation initiation codon, which will be an ATG. This may be accomplished simply by 
choosing a target viral sequence which does not include the translation initiation codon. 
Alternatively, this may be accomplished by disrupting the ATG codon either by adding, 
deleting or modifying nucleotides within this codon using standard techniques. 

Any messenger RNA molecule produced by the target monocotytedonous vims, or 
any portion of such' a molecule, may be used as the target viral sequence. The target viral 
sequence is preferably at least 120 nucleotides in length, more preferably at least 250 
nucleotides in length, and most preferably at least 500 nucleotides in length. 

A translationally altered viral RNA according to the invention includes any modified 
form of a naturally occumng viral messenger RNA sequence which cannot be completely 
translated as compared to the unmodified, naturally occurring form. Thus a translationally 
altered viral RNA may either be incapable of being translated at all, or it may be caoable of 
translating an attenuated peptide corresoonding to a po.^ion of the peptide encoded by the 
target viral sequence used as a template. 
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The inhibitory effect of a translationally altered virai RNA is contemplated to be based, 
at ieast in part, upon its effect on host cell degradation mechanisms. Production of a 
translationally altered viral RNA in a plant cell is contemplated to trigger one or more cellular 
RNA degradation mechanisms which target the translationally altered viral RNA, as well as 
any corresponding homologous unaltered viral RNA molecules which may be present in the 
cell (see, e.g. page 550 of Dougherty, W.G. et ai., Moi. Plant-Microbe Int. 7(5): 544-552 
(1994); Chasan, R., The Plant Ceil 6: 1329-1331 (1994)). 

The ability to translate an attenuated peptide, particularly a short peptide less than 20 
amino acids, is contemplated to enhance the triggering effect of the translationally altered 
viral RNA upon host cell RNA degradation pathways contemplated to play a role in inhibition 
of virai infection. Thus translationally altered RNAs which are capable of translating an 
attenuated peptide are prefenred. More preferably, the translationally altered viral RNA is 
capable of translating an attenuated peptide less than 20 amino acids in length. For target 
viral RNAs which do not include a translation initiation codon, one may be added in 
conjunction with the addition of a premature stop codon or intenxiption of the reading frame 
to create a translationally altered RNA capable of translating an attenuated peptide (see, for 
example, the constmct pCIB5018 described in Example 4). 

Target viral sequences may be selected from the group consisting of a potyvims, 
luteovinjs, tenuiiviais, carmovirus, machlovirus, geminivirus and reovirus sequences and 
may correspond to the coding sequence for any viral protein, such as a viral coat protein, 
replicase, proteinase, inclusion body protein, heiicase, 6K protein and VPg. Such 
sequences are well known for several monocotyiedonous viruses including, but not limited 
to, MDMV (see SEQ ID NO. 1). Sugarcane mosaic virus (partial sequence; see Frenkel, M. 
J. et ai. J. Gen. Virol. 72:237-242, (1991)). Johnsongrass mosaic virus (partial sequence) 
(see Gough. K. H. et ai., J. Gen. Virol. 68:297-304, (1987), maize chlorotic mottle vims (see 
Nutter, R. C. et al. Nucleic Adds Research 17:3163-3177, (1989)), maize chlorotic dwarf 
virus (see WO 94/21796). maize rough dwarf virus (partial sequence) (see Marzachi, C. et 
al. Virology 180:518-526, (1991)), maize stripe virus (partial sequence) (see Huiet, L et al. 
Virology 182:47-53, (1991); Huiet, L. et al. J. Gen. Virol. 73:1603-1607. (1992); Huiet, L et 
al. GenBank Accession Number L3446, (1993)), maize streak virus (see Mullineaux, P. M. 
et ai EMBC J. 3:3063-3068, (1984)), bariey yellow dwarf viais (see Larkins, B. A. et ai. J. 
Gen. Virol. 72:2347-2355. (1991)). and wheat spindle streak virus (partial sequence) (see 
Sohn, A. et ai. Arch. ViroL 135:279-292, (1994)). 
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Suitable host plants which may benefit from the production of translationally altered 
viral RNA such as altered MDMV RNA include any monocotyledenous species which are 
susceptible to viral infection, particularly infection by a member of the potyvirus family. In 
particular, suitable host plants inlcude maize, wheat, sugarcane and sorghurfl. 

in a preferred embodiment, the target viral sequence used is a coding sequence 
which is identical or highly homologous among two or more monocotyledenous viruses or 
virus strains. Expression of a translationally altered RNA in a monocotyledonous plant 
based on such a shared sequence is contemplated to inhibit infection by any of the viruses 
which produce a messenger RNA having homology with the target viral sequence. 

A second aspect of the present invention is based upon the elucidation and 
characterization by the inventors of the genomic structure and organization of strain B of 
maize dwarf mosaic virus (MDMV-B). Previously, only the genomic sequence of the MDMV- 
B coat protein was known (see Frenkel, M. J. at al., J. Gen. Viroi..72: 237-242 (1991)). As a: 
result of the disclosed invention it is now possible to apply many of the more recent 
recombinant-DNA based approaches that have been used for combating plant viruses to 
MDMV such as the use of chimeric genes comprising a plant promoter operably linked to a 
nucleotide sequence derived from the genomic seauence of maize dwarf mosaic virus strain 
B encoding a viral protein other than a coat protein, wherein transgenic expression of said 
chimeric genes in a plant inhibits infection of said plant with said virus. 

The MDMV-B positive strand RNA genome is believed to be approximately 10,000 
bases in length based on the length of other potyviruses. The sequence of 8530 
nucleotides beginning at the 3' end of the MDMV-B genome is set forth in SEQ ID NO: 1 . A 
single long open reading frame was identified within this sequence of the viral genome and 
the polyprotein amino add sequence encoded by this open reading frame is provided in 
SEQ ID NO: 2. With the sequence information provided, this viral genome can be isolated 
and cloned using a variety of standard genetic engineering techniques weli known to those 
of skill in the art. Three DNA fragments covering 85% of the MDMV-B genome have been 
cloned into a Bluescript 11 SK plasmid backbone (Slratagene). transformed and propagated 
in the E. coli cell line HB101, and deposited on June 29. 1995 with the Midwest Area 
National Center for Agricultural Utilization Research (formeriy known as the National 
Regional Research Lab and still referred to by the corresponding acronym "NRRL"). One of 
the ptasmids designated "1-47" and deposited under the accession No. NRRL B-21479 
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• contains nucleotides 3252-8530 of the MDMV-B genome. Another plasmid designated «2- 
24" and deposited under the accession No. NRRL B-21480 contains nucleotides 1866-3317 
of the MDI^V-B genome. Yet another plasmid designated "9-1-5" and deposited under the 
accession No. NRRL S-21481 contains nucleotides 1-2122 of the MDMV-B genome . 

The polyprotein encoded by the MDMV-B genome includes a single coat protein 
designated CP whose coding sequence extends from nucleotide 7308 to 8291 of SEQ ID 
No. 1 and whose amino acid sequence extends from amino acid 2436 to 2763 of SEQ ID 
No. 2. The MDMV-B polyprotein is also contemplated to include a replicase protein, three 
proteinases, a 6K protein, a helper component, proteins involved in viral movement in the 
host plant (both cell to cell and long distance transport), a helicase protein and a VPg 
protein. 

MDMV-3 is contemplated to contain a serine-like proteinase analogous to serine-IIke 
proteinases that have been identified in related potyviruses. These serine-like proteinases 
have a characteristic catalytic domain of three amino acids consisting of a histidlne at 
position 1 of the domain, an aspartic acid at the second position, and a cysteine at the third 
(see Bazan. J. F. and Retterick, R. J., Proc. Natl. Acad. Sci. USA 85: 7872-7876 (1988)). 
These amino acids are separated in the primary amino acid sequence by a region spanning 
approximately 140 amino acids. The intervening sequences between each of the catalytic 
domain sequences exhibits additional limited homology among the known proteinases (see 
Bazan, J. F. and Retterick. R. J., Proc. Natl. Acad. Sci. USA 35: 7872-7876 (1988)). 
Based upon comparison with the known proteinase sequences, the MDMV-B proteinase 
catalytic domain is contemplated to span a 105 amino acid sequence from position 1718 to 
1823 of SEQ ID No: 2 with the three catalytic residues occurring at amino acids 1718. 1753. 
and 1823 of SEQ ID No. 2 

MDMV-B is also contemplated to contain a second proteinase analogous to the 
cysteine proteinases that have been identified in related potyviruses. The active-site 
residues form a catalytic diad made up of a conserved cysteine and histidine which are 
separated by 72 amino acids (see Oh. C. and Carrington, J. C. Virology 173:692-699, 
(1989)). This proteinase is located within the carboxy-terminus of the HC-Pro region of the 
potyvirus polyprotein. Based upon comparison with the known proteinase sequences of 
tobacco etcn virus, the MDMV-B HC-Pro proteinase domain is contemplated to span a 74 
amino acid region from position 263 to 336 of SEQ 1^ No: 2 with the two catalytic residues 
occuning at amino acids 263 and 336. 
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The location of the MDMV-B putativ heiicase domain can be identifi d based on th 
homology with other known viral heiicase domains (see Gorbalenya, A. E. et al.» Nucleic 
Acids Research 17 (12):471 3-4730. (1989)). The heiicase domain consists of seven distinct 
highly conserved segments which correspond to the NTP-binding motif. The primary 
consensus site consists of a glycine at position 1 of the motif, glycine at position 3, lysine at 
position 4. and either a serine or threonine at position 5 (see Gorbalenya. A. E. et al. supra). 
The conserved heiicase domain is located in the MDMV-B genome within a region encoding 
the cylindrical inclusion protein (CIP) and is found from amino adds 880 to 1010 of SEQ ID 
No: 2. The conserved domain (GxGDS) is located at amino acids 883, 385, 886, and 887 
of SEQ ID No: 2. 

The coding sequence for the replicase gene of MDMV-B may also be determined by 
the location of conserved motifs common to viral replicase genes and by identification of 
putative viral proteinase cleavage sites bordering the replicase coding sequence. 
Conserved motifs have been found in other viral replicases. In particular, the conserved 
amino acid motif GDD (known as domain C) is the hallmark consensus sequence for all 
RNA- dependent replicases (Poch et al. EMBO 8: 3867-3874 (1989)). This conserved motif 
is found at amino acids 2266-2268 in the MDMV-B open reading frame (SEQ ID No: 2). 
Two additional conserved motifs characteristic of a plant viral replicase have been identified 
and designated as domain A and B (Poch et al., supra). Domain A is a 17 amino add 
sequence with two centrally conserved amino acids which are present in the MDMV-B 
genome at amino adds 2163 and 2168 of SEQ ID No: 2. Domain B is a 10 amino add 
sequence consisting of 5 conserved amino adds which are present in the MDMV-B genome 
at amino adds 2222, 2223, 2224, 2225 and 2226 of SEQ ID No: 2. 

The isolated MDMV-B genomic sequences taught by the present invention are 
particularly useful for the development of viral resistance in susceptible host plants. With 
the information provided by the present invention, several approaches for inhibiting plant 
virus infection in susceptible plant hosts which Involve expressing in such hosts various 
inhibitory transcripts or proteins derived from the target virus genome may now be applied 
to MDMV. 

Use of translationally altered RNA in a method tor proaucing a monocotyledonous 
plant with an inheritable trait of resistance to infection by a maize dwarf mosaic virus 
comprising transforming said plant with a chimeric gene comprising a monocotyledonous 
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plant promoter operably linked to a nucleotide sequence derived from the genomic 
sequence of a maize dwarf mosaic virus, wherein said nucleotide sequence contains a 
modification rendering a messanger RNA transcribed from said nucleotide sequence 
incapable of complete translation, may now be applied to MDMV-B, as demcfnstrated by 
Example 4. 

Another approach which may be used to confer plant virus resistance is to express 
the gene of the target virus in the host plant (e.g. WO 94/18336 to Tumer et aK for potato 
leaf roll virus and WO 91/13542 to Zaitlin et al. for tobacco mosaic virus; herein 
incorporated by reference in their entirety). This approach may also be applied to MDMV-B 
using the information provided by the present invention. 

For resistance strategies which depend upon expression of a viral replicase coding 
sequence in a transgenic plant, a cDNA clone encompassing nucleotides 5745 to 7307 of 
SEQ ID No: 1, contemplated to include the active domains of the MDMV-B can be used for 
plant transfonnation. More preferably, such strategies may be employed by transforming a 
plant with larger expressible fragments of the MDMV-3 genome contemplated to 
encompass the entire replicase protein. In this case, the MDMV-B replicase would be 
cleaved from the encoded polypeptide when exposed to MDMV-B viral proteinase in the 
plant cell. 

The MDMV-B replicase coding sequence may be engineered for recombinant 
expression in a monocotyiedonous host plant which is normally susceptible to infection by 
MDMV-B. Expression of MDMV-B replicase in such a monocotyiedonous host plant is 
contemplated to confer resistance to (i.e. inhibit) MDMV-B infection. 

Suitable host plants which may benefit from application of any of the resistance 
approaches described above include any monocotyiedonous species which are susceptible 
to infection by MDMV-B. In particular, suitable host plants are contemplated to include 
maize, sorghum and sugarcane. 

To express inhibitory transcripts or proteins derived from the MDMV-B genome in a 
host plant cell, the corresponding coding sequence is operably linked to regulatory 
sequences which cause its expression in the chosen host plant cell. Examples of 
promoters capable of functioning in plants or plant cells, i.e., those capable of driving 
expression of the associated coding sequences such as MDMV-B CP in plant ceils, include 
the cauliflower mosaic virus (CaMV) 19S or 35S promoters and CaMV double promoters: 
nopaiine synthase promoters; pathogenesis-related (PR) protein promoters: small subunit of 
ribulose bisphosphate carboxylase (ssuRUBISCO) promoters; plant ubiquitin gene 
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promoters: plant actin gen promoters; plant pith-preferred promot rs. and the like. 
Preferred are the rice actin promoter (McHroy et al.. Mol. Gen. Genet. 231: 150 (1991)). 
maize ubiquitin promoter (EP-A-342 926; Taylor et al.. Plant Cell Rep.1 2: 491 (1 993); Told 
et al.. Plant Phys. 100:1503-1507 (1992)). a maize pith-preferred promoter (WO 93/07278 
incorporated by reference herein in its entirety; in particular see Rgure 24 and pages 27- 
28), and the Pr-1 promoter from tobacco, Arabidopsis. or maize (see EP-A-332 104). Also 
prefen-ed are the 35S promoter and an enhanced or double 35S promoter such as that 
described in Kay et al.. Science 236: 1299-1302 (1987) and the double 35S promoter 
cloned into pCGN21 13. deposited as ATCC 40587. The promoters themselves may be 
modified to manipulate promoter strength to increase expression of MDMV-B coding 
sequences in accordance with art-recognized procedures. 

The chimeric DNA constnjct(s) of the invention may contain multiple copies of a 
promoter or multiple copies of a particular coding sequence. In addition, the construct(s) 
may include coding sequences for markers and coding sequences for other peptides such 
as signal or transit peptides, each in proper reading frame with the other functional 
elements in the DNA molecule. The preparation of such constructs are within the ordinary 
level of skill in the art. 

Since the MDMV-B proteins are naturally expressed as part of a polyprotein, each 
protein does not include its own translation initiation and translation stop codon. To express 
such proteins individually in the context of a chimeric gene, a translation initiation codon will 
need to be added immediately adjacent to the first codon if one does not occur as part of 
the coding sequence. The skilled artisan will recognize that addition of such a translation 
initiation codon will add a methionine amino acid to the end of the encoded protein. Such 
an addition is not contemplated to have any significant effect upon the properties of the 
protein. Also, a translation stop codon will need to be added to the chimeric gene 
immediately after the last codon of the protein if one does not naturally occur at this 
location. 

Useful markers include peptides providing herbicide, antibiotic or drug resistance, 
such as. for example, resistance to hygromycin. kanamycin, G418. gentamycin. iincomycin, 
methotrexate, glyphosate. phosphinothricin. or the like. These markers can be used to 
select cells transformed with the chimeric DNA constructs of the invention from 
untransformed cells. Other useful mari<ers are peptidic enzymes which can be easily 
detected by a visible reaction, for example a color reaction, for example luciferase, 
B-glucuronidase, or B-galactosidase. 
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Standard recombinant DNA and molecular cloning techniques used in the following 
examples are well known in the art and are described by J. Sambrook. 5. F. Fritsch and T. 
Maniatis, Molecular Cloning: A Laboratory manual . Cold Spring Harbor laboTatory, Cold 
Spring Harbor, NY (1989) and by T.J. Silhavy. M.L. Berman, and LW. Enquist, Experiments 
with Gene Fusions. Cold Spring Harbor Laboratory. Cold Spring Harbor, NY (1984). 

Example 1 : Construction of Plant Transformation Vectors 

Numerous transfonmation vectors are available for plant transformation, and the 
genes of this invention can be used in conjunction with any such vectors. The selection of 
vector for use will depend upon the prefered transformation technique and the target 
species for transformation. For certain target species, different antibiotic or herbicide 
selection markers may be preferred. Selection mart<ers used routinely in transformation 
include the nptll gene which confers resistance to kanamycin and related antibiotics 
(Messing & Vierra, Gene 19: 259-268 (1982); Bevan et al.. Nature 304:184-187 (1983)). the 
bar gene which confers resistance to the hertsicide pnosphinothricin (White et al.. Nucl. 
Acids Res. 18: 1062 (1990), Spencer et al., Theor. Appl. Genet. 79: 525-631(1990)). the 
hph gene which confers resistance to the antibiotic hygromycin (Blochinger & Diggeimann, 
Moi Ceil Biol 4: 2929-2931 (1984)). and the dhfr gene, which confers resistance to 
methotrexate (Bourouis et al., EMBO J. 2(7): 1099-1104 (1983)). 

1- Construction of Vectors Suitable for Aarobacterium Transformation 

Many vectors are available for transformation using Agrobacterium tumefaciens. 
These typically carry at least one T-DNA border sequence and include vectors such as 
pBIN19 (Bevan, Nucl, Acids Res. (1984)). Below the construction of two typical vectors is 
described. 

1.1. Construction of dC1B200 and oClB2001 

The binary vectors pCIB200 and pCIB2001 are used for the construction of 
recombinant vectors for use with Agrobacterium and was constructed in the following 
manner. pTJSTSkan was created by Narl digestion of pTJS75 (Scnmidhauser & Helinski. J 
Bacteriol. 164: 446-455 (1985)) allowing excision of the tetracycline-resistance gene. 
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followed by insertion of an Acci fragment from pUC4K carrying an NPTII gen (Messing & 
Vierra, Gene 19: 259-268 (1982); B van et al.. Nature 304: 184-187 (1983); McBride et al„ 
Plant Molecular Biology 14: 266-276 (1990)). Xhoi linkers were ligated to the EcoRV 
fragment of pClB7 which contains the left and right T-DNA borders, a plant selectable 
nos/nptll chimeric gene and the pUC poiyiinker (Rothstein et al.. Gene 53: 153-161 (1987)), 
and the Xhol-digested fragment was cloned into Sail-digested pTJS75kan to create 
PC;B200 (see also EP-A-332 104. example 19). pCIB20G contains the following unique 
poiyiinker restriction sites: EcoRI, SstK KpnI. Bglll, Xbal. and Sail. pClB2001 is a derivative 
of PCIB200 which was created by the insertion into the poiyiinker of additional restriction 
sites. Unique restriction sites in the poiyiinker of pCIB200l are EcoRI, SstI, Kpnl, Bglll, 
Xbal, Sail. Mlul. Bell, Avril, Apai, Hpai, and Stul. pCIB2001, in addition to containing these 
unique restriction sites also has plant and bacterial kanamycin selection, left and right T- 
DNA borders for Agrobacterium-mediated transformation, the RK2-derived trfA function for 
mobilization between E. coll and other hosts, and the OriT and OriV functions also from 
RK2. The pCIB2001 poiyiinker is suitable for the cloning of plant expression cassettes 
containing their own regulatory signals. 

1- 2. Construction of dC!B1 0 and Hvoromvcin Selection Derivatives thereof 

The binary veaor pCIBiG contains a gene encoaing Kanamycin resistance for 
selection in plants, T-DNA right and left border sequences and incorporates sequences 
from the wide host-range plasmid pRK252 allowing it to replicate in both E. coli and 
Agrobacterium. Its construction is described by Rothstein et a!.. Gene 53: 153-161 (1987). 
Various derivatives of pCIBIO have been constructed which incorporate the gene for 
hygromycin B phosphotransferase described by Gritz et al.. Gene 25: 179-188 (1983)). 
These derivatives enable selection of transgenic plant cells on hygromycin only (pCIB743), 
or hygromycin and kanamycin (pCiB715, pCiB717). 

2- Construction of Vector s Suitable for non-Aarobacterium Transformation . 

Transformation without the use of Agrobacterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transfomnation vector and consequently 
vectors lacking these sequences can be utilized in idition to vectors such as the ones 
described above which contain T-DNA sequences. ' fransrormation techniques which do not 
rely on Agrobacterium include transformation via particle bombardment, protoplast uptake 
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(e.g. PEG and electroporation) and microinjection. The choice of vector depends largely on 
the preferred selection for the species being transformed. Below, the construction of some 
typical vectors is described. 

2.1 Construction of dCIS3064 

pCIB3064 is a pUC-derived vector suitable for direct gene transfer techniques in 
combination with selection by the herbicide basta (or phosphinothricin). The plasmid 
pCIB246 comprises the CaMV 35S promoter in operational fusion to the E. coli GUS gene 
and the CaMV 35S transcriptional temiinator and is described in WO 93/07278. The 35S 
promoter of this vector contains two ATG sequences 5' of the start site. These sites were 
mutated using standard PGR techniques in such a way as to remove the ATGs and 
generate the restriction sites Sspl and Pvull. The new restriction sites were 96 and 37 bp 
away from the unique Sail site and 101 and 42 bp away from the actual start site. The 
resultant derivative of pCIB246 was designated pC!B3025. The GUS gene was then 
excised from pCIB3025 by digestion with Sail and Sad. the termini rendered blunt and re- 
ligated to generate plasmid pC:B3060. The plasmid pJIT82 was obtained from the John 
Innes Centre, Norwich and a 400 bp Smal fragment containing the bar gene from 
Streptomyces viridochromogenes was excised and inserted into the Hpai site of pCIB3060 
(Thompson et al. EMBO J 6: 2519-2523 (1987)). This generated pC!B3064 which 
comprises the bar gene under the control of the CaMV 35S promoter and temninator for 
herbicide selection, a gene fro ampicillin resistance (for selection in E. coli) and a poiylinker 
with the unique sites SphI, PstI, Hindlll. and BamHI. This vector is suitable for the cloning 
of plant expression cassettes containing their own regulatory signals, 

2.2 Construction of dSOG19 and dSCG35 

pSOG35 is a transformation vector which utilizes the E. coli gene dihydrofolate 
reductase (DHFR) as a selectable marker conferring resistance to methotrexate. PGR was 
used to amplify the 35S promoter (-300 bp), intron 6 from the maize Adhi gene (-550 bp) 
and 18 bp of the GUS untranslated leader sequence from pSOGlO. A 250 bp fragment 
encoding the E. coli dihydrofolate reductase type tl gene was also amplified by PGR and 
these two PGR fragments were assembled with a Sacl-PstI fragment from pBI221 
(Clontech) which comprised the pUGI 9 vector backbone and the nopaline synthase 
terminator. Assembly of these fragments generated pS0G19 which contains the 35S 
promoter in fusion with the intron 6 sequence, the GUS leader, the DHFR gene and the 
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nopaiine synthase terminator. Replacement of the GUS leader in pS0Gl9 with the leader 
sequence from Maize Chlorotic Mottle Virus (MCMV) generated the vector pSOG35. 
pS0G19 and pSOG35 carry the pUC gene for ampicillin resistance and have Hindlll, SphI, 
PstI and EcoRI sites available for the cloning of foreign sequences- 
Example 2: Construction of Plant Expression Cassettes 

Gene sequences intended for expression in transgenic plants are firstly assembled in 
expression cassettes behind a suitable promoter and upstream of a suitable transcription 
terminator to create a chimeric gene. These expression cassettes can then be easily 
transferred to the plant transformation vectors described above in Example 1 . 

Promoter Selection 

The selection of a promoter used in expression cassettes or chimeric genes will 
determine the spatial and temporal expression pattern of the transgene in the transgenic 
plant. Selected promoters will express transgenes in specific cell types (such as leaf 
epidermal ceils, mesophyll cells, root cortex cells) or in specific tissues or organs (roots, 
leaves or flowers, for example) and this selection will reflect the desired location of 
expression of the transgene. Alternatively, the selected promoter may drive expression of 
the gene under a light-induced or other temporally regulated promoter. A further alternative 
is that the selected promoter be chemically regulated. This would provide the possibility of 
inducing expression of the transgene only when desired and caused by treatment with a 
chemical inducer. 

Transcriptional Terminators 

A variety of transcriptional terminators are available for use in expression cassettes. 
These are responsible for the termination of transcription beyond the transgene and its 
con-ect poiyadenyiation. Appropriate transcriptional terminators and those which are known 
to function in plants and include the CaMV.35S terminator, the tmi terminator, the nopaiine 
synthase terminator, the pea racS £9 terminator. These can be used in both 
monocotyledons and dicotyledons. 
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Sequence s for the Enhancement or Heoulation of Expression 

Numerous sequences have been found to enhance gene expression from within the 
transcriptional unit and these sequences can be used in conjunction with the genes of this 
invention to increase their expression in transgenic plants. 

Various intron sequences have been shown to enhance expression, particularly in 
monocotyledonous cells. For example, the introns of the maize Adhi gene have been 
found to significantly enhance the expression of the wild-type gene under its cognate 
promoter when introduced into maize cells. Intron 1 was found to be partlculariy effective 
and enhanced expression In fusion constructs with the chloramphenicol acetyltransferase 
gene (Callis et al., Genes Develop. T: 1183-1200 (1987)). In the same experimental 
system, the intron from the maize bronze 1 gene had a similar effect in enhancing 
expression (Callis et al., supra). Intron sequences have been routinely incorporated into 
plant transfomnation vectors, typically within the non-translated leader. 

A number of non-translated leader sequences derived from viruses are also known to 
enhance expression. Specifically, leader sequences from Tobacco Mosaic Virus (TMV, the 
"W-sequence"). Maize Chlorotic Mottle Virus (MCMV). and Alfalfa Mosaic Vims (AIMV) have 
been shown to be effective in enhancing expression (e.g. Gallie et al. Nucl. Acids Res. 15: 
8693-871 1 (1987); Skuzeski et al. Plant Moiec. Biol. 15: 65-79 (1990)) 

Example 3: Transformation of Monocotyledons 

Transformation of monocotyledon species such as wheat or maize has become 
routine. Preferred techniques include direct gene transfer into protoplasts using PEG or 
electroporation techniques, and particle bombardment into callus tissue. Transformations 
can be undertaken with a single DNA species or multiple DNA species (i.e. co- 
transformation) and both these techniques are suitable for use with this invention. Co- 
transformation may have the advantage of avoiding complex vector constnjction and of 
generating transgenic plants with unlinked loci for the gene of interest and the selectable 
mariner, enabling the removal of the selectable marker in subsequent generations, should 
this be regarded desirable. However, a disadvantage of the use of co-transformation is the 
less than 100% frequency with which separate DNA species are integrated into the genome 
(Schocher et al. Biotechnology 4: 1093-1096 (1986)). 

EP-A-292 435 (to Ciba-Geigy). EP-A-392 225 (to Ciba-Geigy) and WO 93/07278 (to 
Ciba-Geigy) describe techniques for the preparation of callus and protoplasts from an elite 
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inbred line of maize, transformation of protoplasts using PEG or electroporation, and the 
regeneration of maize plants from transfonned protoplasts. Gordon-Kamm et a!.. Plant Cell 
2: 603-618 (1990)) and Fromm et al.. Biotechnology 8: 833-839 (1990)) have published 
techniques for transformation of Al SB-derived maize line using particle bomoardment. 
Furthermore. WO 93/07278 (to Ciba-Geigy) and Kcziel et al., Biotechnology 11: 194-200 
(1993)) describe techniques for the transformation of elite inbred lines of maize by particle 
bombardment. This technique utilizes immature maize embryos of 1.5-2.5 mm length 
excised from a maize ear 14-15 days after pollination and a PDS-lOOOHe Biolistics device 
for bombardment. 

Transfomiation of rice can also be undertaken by direct gene transfer techniques 
utilizing protoplasts or particle bombardment. Protoplast-mediated transfomiation has been 
described for Japonica-types and Indica-types (Zhang et al.. Plant Cell Rep 7: 379-384 
(1988); Shimamoto et al. Nature 338: 274-277 (1989); Datta et al. Biotechnology 8: 736-740 
(1990)). Both types are also routinely transfonnable using particle bombardment (Christou 
et al. Biotechnology 9: 957-962 (1 991 )). 

EP-A-332 581 (to Ciba-Geigy) describes techniques for the generation, transfonnation 
and regeneration of Pooideae protoplasts. These techniques allow the transformation of 
Dactylis and wheat. Furthermore, wheat transformation has been described by Vasil et al., 
Biotechnology 10: S67-674 (1992)) using panicle bombardmem into cells of type C long- 
term regenerabie callus, and also by Vasil et ai.. Biotechnology 11: 1553-1558 (1993)) and 
Weeks et al.. Plant Physiol. 102: 1077-1084 (1993) using particle bombardment of 
immature embryos and immature embryo-derived callus. A prefen-ed technique for wheat 
transformation, however, involves the transformation of wheat by particle bombardment of 
immature embryos and includes either a high sucrose or a high maltose step prior to gene 
delivery. Prior to bombardment, any number of embryos (0.75-1 mm In length) are plated 
onto MS medium with 3% sucrose (see Murashige & Skoog. Physiologia Plantanjm 15: 473- 
497 (1962)) and 3 mg/l 2,4-D for induction of somatic embryos which is allowed to proceed 
in the dark. On the chosen day of bombardment, embryos are removed from the induction 
medium and placed onto the osmoticum (i.e. induction medium with sucrose or maltose 
added at the desired concentration, typically 15%). The embryos are allowed to plasmolyze 
for 2-3 h and are then bombarded. Twenty embryos per target plate is typical, although not 
critical. An appropriate gene-carrying plasmid (such as pCIB3064 or pSG35) is precipitated 
onto micrometer size gold particles using stanaard procedures. Eacn plate of embryos is 
shot with the DuPont Biolistics helium device using a burst pressure of -1000 psi using a 
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Standard 80 mesh sere n. After bombardment, the embryos are placed back into the dark 
to recover tor about 24 h (still on osmoticum). After 24 hours, the embryos are removed 
from the osmoticum and placed back onto induction medium where they stay for about a 
month before regeneration. Approximately one month later the embryo explants with 
developing embryonic callus are transfen-ed to regeneration medium (MS + 1 mg/lrter NAA, 
5 mg/liter GA). further containing the appropriate selection agent (1 0 mg/l basta in the case 
of PCIB3064 and 2 mg/l methotrexate in the case of pSOG35). After approximately one 
month, developed shoots are transfen-ed to larger sterile containers known as "GA7s" which 
contained half-strength MS, 2% sucrose, and the same concentration of selection agent. 

Example 4: MDMV-B Resistacne Conferred by Expression of Translationally Altered 
Viral Transripts 

Our research has focused on cloning and sequencing the remainder of the MDMV-B 
genome. We have disclosed the majority of the MDMV-B sequence in this application. We 
have identified coding regions within the MDMV-B coding region based on conserved motifs 
previously identified in other potyviruses. The regions of the virus selected for use as 
transgenes have been the MDMV-B non-structural proteins (i.e. Replicase, Proteinase, and 
Helicase). These regions were targeted based on the expected higher degree of sequence 
consen/ation within these genes among strains of MDMV, We predict that the use of these 
regions will give the highest probability of obtaining resistance to multiple strains of MDMV 
when transfomied into elite maize inbreds. The sequences have been used to transfonn 
maize plants for the purpose of confening virus resistance. 

Maize dwarf mosaic vims strain B (MDMV-B) was obtained from Dr. S. Jensen 
(University of Nebraska-Lincoln) and maintained in a susceptible maize inbred by serial 
inoculation. Vims was prepared for inoculation as previously described (see Law, M. D. et 
al. Phytopathology 79:757-761, (1988)). 

The virus was purified from two week old infected maize tissue by the following 
protocol. The harvested tissue was homogenized with 0.2 sodium acetate, pH 5.0 
containing 0.1% b-mercaptoethanoi (1:5 ratio W:V) in a blender. The homogenate was 
filtered through cheesecloth and then centrifuged for 15 minutes at 6000 RPM (Sorvall GSA 
rotor). The recovered supematent was then filtered through glass wool and adjusted to a 
concentration of 0.5% Triton X-100 and 0.2M NaCI. The virus was precipitated from the 
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soluiton by aading PEG 8000 (8% final concentration) and then stirring for 2 hours at 4_C. 
The Virus was recovered by centrifugation for 15 minutes at 8.000 RPM (Son/all GSA rotor). 

The resulting pellet containing the vims was resuspended by stirring in 0.1 M Tris pH 
S.5 containing 0.032 M sodium citrate. The virus solution was clarified by centrifugation 
through a 20% sucrose pad for 2 hours at 28.000 RPM (SW28 rotor). The recovered pellet 
was resuspended in 10 ml of 0.1 M Tris pH 6.5 containing 0.032 M sodium citrate. The 
supematent was adjusted to a concentration of 34% cesium sulfate and centrifuged for 14 
hours at 48.000 RPM {Ti 70.1 rotor). The opalescent band containing the virus was 
removed and dialyzed against 0.1 M Tris pH 6.5 containing 0.032 M sodium citrate. Viral 
RNA was isolated from the purified virions by phenol extraction and ethanol precipitation. 

The isolated RNA was then used as template for cDNA preparation using oiigo dT 
primers. The preparation of cDNA clones were perfomied by standard procedures as 
described (see Sambrook. J. et al.. IMolecular Cloning: A Laboratory Manual". Cold Spring 
Hartjor Laboratory Press, {1 989)). 

Constructs were prepared to spedfic regions of the MDMV-B genome by PCR 
amplification from cDNA clones. The region amplified by PCR was typically 1200 to 1400 
nucleotides in length and was confirmed by sequencing. Constmcts were prepared to the 
regions of the MDMV-B genome which encode the viral replicase (Nib), proteinase (NIa). 
and cylindrical inclusion protein (CiP). These regions were selected baseo on the higher 
sequence conservation within these regions between members of the potyvirus family. The 
constructs corresponding to a specific viral coding region were altered during PCR 
amplification by nucleotide substitutions within the primers. A methionine translation 
initiation codon was generated at the first codon preceding the first native codon and a 
termination codon was created at the seventh codon in all constructs tested. This would 
create a mRNA only capable of translating small peptides. The constmcts were then ligated 
into either the pUBA plasmid (see Toki et al. Plant Physiol. 100:1503-1507. (1992)) or the 
PC1B4421 plasmid. The pUBA plasmid contained the Ubiauitin promoter and the NOS 
temiinator while pCIB442l contained the maize pnosphoenoipyruvate carboxylase (PEPC) 
promoter and the 35S terminator. The plasmid consimcts were then venfied by DNA 
sequencing. 

The constnjcts used in this example to transform maize plants have been designated 
PCIB5018 and pClB 5019. pCIB5018 was constructed by ligating the PCR amplified NIa 
fragment (nucleotides 4452 -5744 of SEQ ID No. 1) mto pCIB4421. The NIa fragment used 
for ligation had previously been altered by insenion of an ATG codon .mmediately before 
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the first nucleotide of the first codon (i.e. the G at position 4452 of SEQ ID No. 1) and 
substitution of a thymidine (T) for the adenine (A) at nucleotide 4470 of SEQ ID No. 1 to 
create a premature stop codon. pCIB5019 was constructed by iigating the altered NIa 
fragment described above into the pUBA piasmtd. 

Microproiectile Bombardment Protocols 

Plasmid DNA was precipitated onto 1mm gold microcanier particles as described in 
the DuPont Bioiistic manual: 5mg of plasmid DNA containing a synthetic phosphinothrycin 
acetyltransferase selectable marker gene and 5mg of either pCIB5018 or pCIB5019 were 
added per 50ml of prepared microcanier. The synthetic phosphinothrycin acetyltransferase 
selectable marker gene provides resistance to the same selection agents as the BAR gene 
(see Kramer. C. et al. Planta 190: 454-458 (1993)). Bombardment of tissue was carried out 
with the DuPont PDS-IOOOHe Bioiistic" device. An additional 150x150mesh/linear inch 
screen was inserted 2cm below the stopping screen. Immature embryos were bombarded 
with 1550psi rupture discs on a plate angled 6-8cm below the stopping screen to maximize 
scutellum exposure to particles. Type I callus was placed 4cm below the stopping screen 
and 900psi rupture discs were used in bombardment. All plates for both explant types were 
bombarded twice. 

Immature Embrvo Expiant Source Initiation and Selection 

Immature embryos of a proprietary Ciba elite line (CG00526) were used as the initial 
explant source in microprojectile-mediated transformation. Embryos were excised from the 
ears 10-14 days post-pollination, when 1-2mm in length. After surface sterilization in a 10% 
Clorox solution, embryos were plated embryonic axis down on the surface of the agar- 
solidif led medium. Embryos were plated onto Duncan's "D" callus induction medium plus 
5mg/l chloramben, 2% sucrose, 12mM proline and either the organic amendments speafied 
in Duncan's (2DG4) or a modified version (2DA1) which omits the casein hydrolysate and 
adds the amino acids minus glutamine and asparagine from Kao and Michayluk's •'KM* 
medium (see Kao and Michayluk, Planta 126:105-110. (1975)). The plated embryos were 
kept in a 25_C dark culture room continuously until the regeneration pnase was initiated. 
The day after plating the embryos were transferred to the appropriate G4 or Al media 
containing 12% sucrose at least four hours prior to microprojectile bombardment. Thirty-six 
embryos wer arranged in a 2-3 cm circle in the center of the plate. The embryos remain 
on the 12% sucrose plate ovemight after bombardment. The following day, embryos were 
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transferred either to 2DG4 + 5 chioramben + the equivalent of a lOmg/l concentration of 
Basta'herbicide (glufosinate ammonium) or 2DA1 ^5 chioramben + 5mg/i Basta. 

Fourteen days from the initial excision and plating, developing compact, organized 
type I callus was excised from the original explant and subcultured to either 2DG4 -r O.Smg/l 
2,4-D + I0mg/l Basta or 2DA1 -r0.5mg/l 2,4-0 ^ 5mg/l Basta. Viable, healthy callus was 
serially subcultured every fouaeen days during the selection phase. All tissue was then 
transfen-ed to Duncan's medium, modified by omitting all amino acids, plus 2% sucrose. 
0.5mg/! 2,4-0 and 10mg/l Basta (2DG8) at the end of eight weeks. After a two week 
passage on the G8 medium, all living tissue was transferred to regeneration medium. 

Type i Ex Diant Source Initiation and Selection 

Immature embryos of the Ciba elite line (CG00526) were plated embryonic axis down 
onto 2DG4 + 5 chioramben at the 1 .2mm length size. The developing compact, highly 
organogenic (type I) callus was excised from the original embryo explant after fourteen days ■ ' 
and maintained serially on 2DG4 + 0.5mg/l 2,4-D by subculturing to fresh medium every 
ten-fourteen days. When the callus lines obtained were two to three months old, they were 
prepared for microprojectile bombardment. The tissue was subcultured to fresh medium in 
small pieces approximately 1-3mm in size one lo two days prior to bomoardment. On the 
day of bombardment, the tissue was arranged in a 2-3cm circle in the center of a DAI plate 
containing 12% sucrose and 0.5mg/l 2.4-0 at least four hours prior to bombardment. The 
callus was kept on the plate after bombardment overnight, and transferred the next day to 
2DA1 -h0.5mg/! 2,4-0 + lOmg/lBasta. Viable, healthy callus was serially subcultured on the 
same medium every fourteen days during the selection phase. All tissue was transferred to 
Duncan's medium, modified by omitting ail amino acids, plus 2% sucrose. 0.5mg/l 2.4-0 and 
10mg/l Basta (2DG8) at the end of eight weeks. After a two week passage on the G8 
medium, ail living tissue was transfen-ed to regeneration medium. 

Reqeneraiion and Plantlet Establisn ment of Immature Embn/o and Tvoe I Exolant Source 
Experiments 

Tissue for regeneration was moved to a 25_C light culture room under a 16 hour 
photoperiod. Regeneration medium consisted of Murashige and Skocg's (MS) salts and 
vitamins, 3% sucrose ^ 0.25mg/l ancymidoL l.Omg/l NAA ;a5mg/l kinetin and 5mg/l Basta. 
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After a two week passage on the regeneration medium with growth regulators, the tissue 
was transferred to MS medium + 3mg/IBasta and no additional growth regulators. Plantlets 
reaching 1-3cm length were transfen-ed from plates to Magenta"GA-7 boxes containing MS 
medium (0.75X concentration+ i% sucrose) and no Basta for root development. Plantlets 
with sufficient root development were transplanted to soil and moved to the greenhouse. 
Plantlets were hardened off in a 70% humidity phytotron for one to two weeks before 
moving the plants to the greenhouse range. The greenhouse conditions were as follows: 
55% humidity. 400 Einsteins light intensity. 16 hour photoperiod, aO-84_F Day temperature, 
64-68_F Night temperature. Plants were allowed to grow to maturity in the greenhouse and 
were either selfed or backcrossed to the parental line in the T-| generation. 

Analysis of T n Plants 

Tq plantlets were first assayed by polymerase chain reaction (PGR) to detect the 
selectable marker, the gene of interest and an alcohol dehydrogenase (Adh) gene 
sequence as an internal assay control. Plantlets were assayed at approximately eight to 
fourteen cm height, when the plantlets were still in the GA-7 boxes. Standard PGR 
conditions were used {see Kramer, C. et al. Planta 190: 454-458 (1993)). The Adh internal 
control primer pair sequence was TGCATGTCGGTTGTGTTGCA (SEQ ID NO. 3) and 
CTCAQCAAGTACCTAGACCA (SEQ ID No. 4). The primer pair sequence for the synthetic 
PAT gene was TGTGTCCGGAGAGGAGACC'(SEQ ID No. 5) and 
CCAACATCATGCGATCCACC (SEQ ID No. 6). The primer pair sequence for the NIa 
proteinase gene is GCGGGATCCATGGGGAAGAACAAACGCAGTTGA (5') (SEQ ID No. 7) 
and GCQGAGCTCTTACTCTTCAACGCTCGCGTC (3') (SEQ ID No. 8). The parameters for 
PGR amplification for ail primer pairs were 45 sec at 94 _C, 30 sec at 62_C, 30 sec at 72_C 
plus a 3 sea'cycie extension elongation for 40 cycles. 

Plantlets identified by PGR to be transfomied were analyzed by Northern blot assay 
for mRNA transcript of the gene of interest (NIa proteinase). Plants were assayed for 
mRNA expression either while in the GA-7 containers or when the plants had been 
acclimated in the greenhouse. The probe was a 1303 bp fragment of the NIa gene excised 
by a BamHi/Saci restriction digest of the pClB5019 plasmid. Labeling was earned out with 
the Gibco/BRL RadPrime DNA Labeling kit as described by the manufacturer. Northern blot 
protocols were performed as described (see Sambrook. J. et al.. "Molecular Cloning: A 
Laboratory Manual". Cold Spring Harbor Laboratory Press, (1989)). 
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Analysis of Plants 

Ti seed harvested from the To plants was first dried down in the drying room for one 
to two weeks before planting. Seed were planted directly in flats and watered in. The flats 
were bottom watered with either a 0.15% volume/volume Basra solution or with water two 
days after planting. Four different transformation events were tested for herbicide and 
disease resistance in this example, as well as the wild-type elite control. Forty seeds from 
each individual transformed plant were tested initially. 20 in Basta and 20 in the water 
control. Seven days after the first Basta drench, a second drench was carried out in the 
same manner. 

All plants were inoculated with MDMV-B following the second Basta soil drench when 
the plants were 4-5 inches in height (3-5 Leaf Stage). A second virus Inoculation was 
performed on all plantlets 4-6 days after the first inoculation to insure infection. Plants were 
scored for viability in the plus and minus Basta drench and for the presence or absence of 
viral symptoms at the end of two and a half weeks. 

Plants which showed resistance to the virus, as measured by the absence of viral 
symptoms, and a susceptible sibling were assayed by Northern blot analysis using the NIa 
fragment as described above. The resistant plants were also assayed by EUSA and 
Westem blot analysis for the presence of MDMV-B coat protein in the plants. 

EUSA and Western B lot analvsis of the transgenic Plants. 



The primary antibody used for both assays was a polyclonal antibody specific for the 
MDMV-B coat protein which was obtained from Dr. S. Jensen (University of Nebraska- 
Uncoln). The second antibody was an affinity purified polyclonal IgG alkaline phosphatase 
labeled goat anti-rabbit antibody (Kirkegard and Perry Laboratories. Gaithersburg. 
Maryland). 



ELISA Analvsis 



Tissue samples were taken from all plants not exhibiting charactenstic MDMV-B 
symptoms and from one infected plant. Samples were also taken from healthy and infected 
CG00526 plants as controls. The samples (two leaf punches- 1 cm in diameter) were taken 
from both the inoculated leaf and the youngest available leaf. The tissue samples were 
homogenized in 0.400 ml of borate buffered saline (lOOmM boric acid. 25mM sodium 
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borate, 75mM sodium chloride). Aliquots (SOmi) of each sample were applied to a ethanol 
washed ELISA plate and incubated overnight at 4_C. The plates were then washed once 
with ELISA wash buffer (lOmM Tris-HCI, 0.05% Tween-20, 0.02% sodium azide). and 
blocked with ELISA block/diluent (lOmM sodium phosphate, 140mM sodium chloride, 
0.05% Tween-20, 1% BSA, 0.02% sodium azide) for one hour at room temperature. The 
plates were washed three times with EUSA wash buffer. The primary antibody was applied 
at a 1 :5000 dilution in 50ml of ELISA block/diluent and incubated for 2 hours at 27 JZ and 
then washed three times with EUSA wash buffer. The second antibody was applied at a 
concentration of 1.5mg/ml in ELISA block/diluent and incubated for 2 hours at 37_C. The 
plates were washed three times with EUSA wash buffer and were developed by incubation 
in ELISA substrate (Kirkegard and Perry) for 30 minutes at room temperature. The reaction 
was stopped by the addition of 50ml of 3M sodium hydroxide. The plates were read with a 
SLT 340 ATTC ELISA plate reader (SLT Labinstruments) at 405nm. 

Western Blot Analvsis 

Western blot analysis was performed on samples used for EUSA analysis, A 2ml 
aliquot of the samples was diluted into 10ml of IX loading dye (Novex Inc). The samples 
were eiectrophoresed on an 8%-16% Tris-glycine polyacryiamide gel (Novex) in Tris running 
buffer (25mM Tris-Base, 192mM glycine, 0.1%SDS) at 120 volts for approximately 2.5 
hours. The gel was blotted onto nitrocellulose using a Biorad blotting apparatus in transfer 
buffer (25mM Tris-Base, 192mM glycine, and 10% methanol) at 120 volts for 45 minutes. 
The filter was blocked with blocking/diluent (IX TBS. (20mM Tris-Base, SOOmM NaCI, pH 
7.5), 0.05% Tween-20, 1% BSA, 5% iamb serum) at room temperature for 45 minutes. The 
filter was incubated with the primary antibody, described above, at a dilution of 1 :1000 in 
blocking/diluent at room temperature for 1.25 hours. The filter was washed for five minutes 
in 1 XTTBS, (1 X TBS, 0.05% Tween-20). The second antibody, described above, was 
incubated with the filter in blocking/diluent at a dilution of 1 :1 000, for 1 .25 hours at room 
temperature. The filters were washed twice for 5 minutes in 1 XTTBS followed by a single 
wash in 1XTBS for 5 minutes. The filter was developed with Nitro Blue Tetrazolium (NBT) 
and 5-bromo-4-chloro-3-indolyi phosphatase (BCIP) in 0.1M Tris-HCl pH 9.5 as described 
by the manufacturer. The filter was developed for approximately 20 minutes and then 
stopped by washing the filter with water. 



wo 97/02352 



PCT/EP96/02673 



- 28 - 

Characterization of the MDMV-3 Genome 

Clones have been isolated and sequenced representing 8530 nucleotides of the 
MDMV-3 genome. We have identified a single large open reading frame as'would be 
expected of a virus belonging to the potyvirus family. We have identified regions of the 
polyprotein which would encode the coat protein (nucleotides 7308-8291 of SEQ ID No. 1 
and amino acids 2436-2763 of SEQ ID No. 2), the putative RNA dependent RNA 
polymerase (RdRp) termed Nib (nucleotides 5745-7307 of SEQ ID No. 1 and amino adds 
1915-2435 of SEQ ID No. 2), the NIa proteinase (nucleotides 4452-5744 of SEQ ID No. 1 
and amino acids 1484-1914 of SEQ ID No. 2), the 6K protein (nucleotides 4293-4451 of 
SEQ ID No. 1 and amino acids 1431-1483 of SEQ ID No. 2). cylindrical inclusion protein 
(CIP) containing the helicase(nucieotides 2376-4292 of SEQ ID No. 1 and amino acids 792 
1430 of SEQ ID No. 2)» P3 proteinase (nucleotides 1 134-2375 of SEQ ID No. 1 and amino 
acids 378-791 of SEQ ID No. 2), and a portion of the helper component-P2 proteinase (HO 
Pro)(nucieotides 3-1133 of SEQ ID No. 1 and amino acids 1-377 of SEQ ID No. 2). 
Identification was based on the location of putative cleavage sites and conserved motifs. 
The MDMV-B sequence of the CP region from our isolate was 99% identical to the 
previously sequenced MDMV-S CP and 78% identical to the MDMV-A CP. Further 
comparisons could not be made due to the lack of additional sequence to ether MDMV 
strains. The sequence of MDMV-a was then compared to other potyviruses and was found 
to exhibit approximately 60% nucleotide sequence identity to other potyviruses. The level 
of identify varied little when sequences encoding the different proteins were used for the 
comparison. 

To Analvsis 

Eighteen lines (individual transformation events from selection and regeneration) were 
obtained from the experiments in this example. 17 of the 18 lines were positive by PCR for 
the selectable mari<er, and 14 for the gene of interest. All 1 4 events which were PCR 
positive for the NIa gene were also positive for expression in the Northern analysis. The 
predominate mRNA species was approximately 1300 nucleotides in length which would 
correspond to the predicted size of the transgene. A smaller species approximately 1000 
nucleotides in length was also detected whicn most likely arose by processing. Differences 
in mRNA expression levels were seen between different events as well as between 
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individual plants (siblings) from a given event. All PGR positive plants were used for seed 
production (Ti). 

Ti^ Analvsis 

Four plants from two different events were identified to be resistant to the vims inoculation 
as evidenced by the absence of visual symptoms. There was no correlation to Basta 
tolerance in this example. Northem analysis of the four plants showed no detectable NIa 
transcript in the four resistant plants, while an infected sibling plant from the same original 
ear (Tq) was shown to have high levels of viral RNA. The levels of MDMV-B in the infected 
sibling was similar to the levels seen in the control CG00526 plants. 

The resistant plants were also evaluated for the presence of viral coat protein by 
ELISA. 

The four values obtained for each sample, duplicate samples from the inoculated leaf and 
non-inoculated leaf, were averaged and a comparison made against the infected and 
healthy controls. No detectable virus was present in the resistant transfomned plant lines by 
EUSA at which the threshold of detection was approximately 2 ng of virus per sample. In 
contrast, the transformed siblings which exhibited symptoms contained levels of vims 
similar to that seen in the infected CG00526 control plants. These results show conclusive 
evidence that the four plants were immune to MDMV-B infection (i.e. not supporting vims 
replication). The resistance was durable in that the resistant plants withstood two 
inoculations with high MDMV-B inoculum concentrations. The inoculum concentrations 
used in these experiments typically result in symptoms within four days in susceptible plant 
lines. Yet, the resistant plants have not produced visible symptoms nor detectable vims six 
weeks following inoculation. 



Various modifications of the invention describea herein will become apparent to those 
skilled in the an. Such modifications are intended to fall within the scope of the appended 
claims. 



wo 97/02352 



-30- 



PCT/EP56/02673 



SEQUENCE LISTING 



(1) GENERAL INFURMAnCN: 



(i) APPLXCSOT: 

(A) NAME: CIBA--GEIGY" AG 

(B) STRECT: XlybecJcstr. 141 

(C) CnY: Basel 

(E) CCUOTRY: Switzerland 

(F) POSm. CODE (ZIP) : 4002 

(G) TELSPHCNE: +41 61 69 11 11 

(H) TELEFAX: + 41 61 696 79 76 

(I) TELEX: 962 991 



(ii) TITLS OF INVEWnCN: USE OF TRANSLAnONALLY ALTEKH) RNA TO CONFER 

RESISTANCE TO MAI2E DWARF MOSAIC VmUS AND OIHER 
M33DCOTym3CNOUS PLANT VIRUSES 



(iii) NUMBER OF SEQUENCES: 8 



(iv) CCMPUTER READABLE FORM: 

(A) MEDIUM TifPE: Floppy disk 

(B> CCMRTTER: IBM PC carrpatible 

{C; 0PERAT:M3 system: PC-DOS/MS-DOS 

(D) SOFTWARE: Pacaitia Release iJl.O, Version n.3CB 



(2) INPCRMATICN FOR SBQ ID N0:1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEN3IH: 8543 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDFTTJESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: RNA {genomic) 



(iii) HYPOTHETICAL: NO 



(ix) FEATORE: 

(A) MAME/KEV: CDS 

(B) LCCATION: 3.. 3291 
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(D) OTHER INFOR^aAna^r: /prodiict= "polyprotein encoded by 
MCMV-3 genQrte" 

(ix) FEATURE: 

(A) NAME/KEiT: 3 'OTR 

(B) IjOCATICN: 8292,-8530 

(ix) FEATURE: 

(A) NAME/KEiT: misc.HNA 

(B) IjOCATICN: 3,. 1133 

(D) OTHER INPC3RMATICN: /iarodLict= -3-priine sequence for 

HC-Pro- 

(ix) FEATURE; 

(A) NAME/KEV: misc_RNA 

(B) LOCATTCN: 1134, .2375 

(D) OTHER INPORMATICN: /p(roduct= 'P3 proteinase" 

(ix) FEATURE: 

(A) JCVME/KEY: misc^RNA 

(B) LOCATICN: 2376,. 4292 

(D) OTHER INFX3RMATICN: /product= "cylindrical inclusion 

protein" 

(ix) FEATURE: 

(A) NAME/KEY: misc^RNA 

(B) IjOCATICN: 4293.. 4451 

(D) OTHER INFORMATICN: /pcoduct= '•K2 (6kD protein) " 

(ix) FEATURE: 

(A) NAME/KEY: ndsc_RNA 

(B) LOCATTCN: 4452.. 5744 

(D) OTHER INFORMATICN: /product^ -NIa proteinase" 

(ix) FEATURE: 

(A) NAME/KEif; misc_RNA 

(B) LOCATTCN: 57457.7307 

(D) OTHER INFORMATICN: /product = "Nib replicase* 

(ix) FEATURE: 

(A) NAME/KEV: misc_RNA 

(B) IjOCATICN: 7308. .8291 

(D) OTHER INFORMATICN: /product= "coat protein* 

(Xi) SEQUE^3CS DESCRIPTTCN: SBQ ID N0:1: 

UC GAA GAG AAA CAA CGA GAG UAU CJU GCA AAG GAU CAA AAA CUA UCC 47 
Glu Glu Lys Gin Arg Glu Tyr Leu Ala Lys Asp Gin Lys Leu Ser 
* 5 10 15 



AGA AUG AUA CAA UUU AL"C AAA GAA AGG UGC AAU CCA AAA OJU UCG CAU 
Arg Mec lie Gin ?he lie Lys Glu Arg Cys ?isn Pro Lys Phe Ser His 
20 25 30 



95 
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UUACCAACGCUAa3GCAAGlJCGCGGAAACAAUAGGGCa::URUACaG^ 143 
Leu Pro TJir Leu Trp Gin Val Ala Glu Thr He Gly His Tyr Ihr Asp 
35 40 45 

AACC»GU=Ai«CCAAAI]AACX3GAUGUai«3CGAAGCGCJCADCAAAGDa 131 
Asn Gin Ser Lys Gin He Mec Asp Val Ser Glu Ala Leu He Lys Val 
SO 55 60 

AAU ACJ aX5 ACJ ecu GAU GAL' GCJ ATO AAA GCA AGC GCA GOS UC3A COa 239 
Asn Hir Leu Hir Pro Asp Asp Ala Met Lys Ala Ser Ala Ala Leu Lau 
65 70 75 

GAAGroUCGCGAUSGUaUAAGAAUCGUAAGGAGUCACXAAAAOJGAC 287 
Glu Val Ser Arg lip IVr Lys Asn Arg Lys Glu Ser Leu Lys Thr Asp 
80 85 90 95 

UCA Uro GAA UOJ UUU AGA AAU AAA ADA UCA CCA AAG AGU ACA AUA AMJ 335 
Ser Leu Glu Ser Phe Arg Asn Lys He Ser Pro Lys Ser Thr He Asn 
100 105 HQ 

GCAGCTUUAAUGUSCGAUAAUCAA-JUGGMJAAAAAUGCAAAUUUUGUA 383 
Ala Ala Leu Mec Cys Asp Asn Gin Leu Asp Lys Asn Ala Asn Etoe Val 
115 120 125 

tXSG GGU AAU AGG GAA UAC CAC GCC AAA C3GA LTTJ UUC GCA AAC UMJ ODD 431 
Trp Gly Asn Arg Glu Tyr His Ala Lys Arg Phe Phe Ala Asn Tyr She 
130 135 140 

^^ftA GCA GDG GAD ox: ACA GRU OCA UAU GAA AAG CAC GUC ACA CGG OUC 479 
Xaa Ala Val Asp Pro Ihr Asp Ala Tyr Glu Lys His Val Thr Arg Ehe 
145 150 155 

AAC CCD AAD GGU CAA CGA AAG UUA UCA ADA GGA AAG UUA GuU AUC era 527 
Asn Pro Asn Gly Gin Arg Lys Leu Ser He Giv Lys Leu Val He Pro 
160 165 i-'o 175 

CJA GAC UDU CAA AAG ADD AGA GAA DCA UDC GOT GGA CJC UCG AUA AAU 57«: 
Leu Asp Phe Gin Lys He Arg Glu Ser Phe Val Gly Leu Ser He Asn 
180 185 190 

AGACAACCGCDGGHJAAAOSUUSDGDUAGCAAGAUCGAAGGAGGGUAU 
Arg Gin Pro Leu Asp Lys Cys . Cys Val Ser Lys He Glu Gly Gly TVr 
195 200 205 

AUA UAC CCA a3D DGC OSC GDC ACA ACA GAA UDU GGu AAA CC^ GCA UAC S7-' 
He Tyr Pro Cys O/s Cys Val Thr Thr Glu Phe Gly Lys Pro Ala Tyr 
210 215 220 

DCJ GAG AUA AUA CCD .rCA AAA GGG CAD AUA ACA AUA GGC AAU DCD 719 
Ser Glu He He Pro Pro Uir Lys Gly His He Thr He Gly Asn Ser 
225 230 235 

AUDGADDCAAAGAUDGDGGACUDGCCAAAUACAACACCACCCAGCAUG 767 
He Asp Ser Lys He Val Asp Leu Pro Asn Thr Thr Pro Pro Ser Mec 
240 245 25C 255 
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UACiUKJGCUAAGGAUaSGUT^uGCUACAirAA^ 315 
Tyr He Ala Lys Asp Gly Tyr O/s Tyr lie Asn He Phe Leu Ala Ala 
260 265 270 

AUG AUC AAC GUU AAU GAA GAA UCU OCC AAG GMJ UAU ACG AAA UUU UUG 863 
Met He Asn Val Asn Glu Glu Ser Ala Lys Asp Tyr Hir Lys Phe h&i 
275 280 285 

AGO GAC GAA CUA GOU GAG CGU CTC GGA AAG UGG CCA AAG CUU AAA GAG 911 
Arg Asp Glu Leu Val Glu Arg Leu Gly Lys Itp Pro Lys Leu Lys Asp 
290 295 300 

GUA GCA ACA GCG IXSU UAU GCA UUA UCU GUA AUG UUU CCA GAA AUU AAG 959 
Val Ala Thr Ala Cys Tyr Ala Leu Ser Val Met Phe Pro Glu He Lys 
305 310 315 

AAU GCa GAG OJA CCD CCA AUU CUA GUU GAC GAU GAA AAU AAA UCA AUG 1007 
Asn Ala Glu Leu Pro Pro He Leu Val Asp His Glu Asn Lys Ser Met 
320 325 330 335 

GAC GUA AUC GAU UCA UAU GSU UCA CUA AGC GUU GGA UUU CAC ADA UUA 1055 
His Val He Asp Ser Tyr Gly Ser Leu Ser Val Gly Phe His He Leu 
340 345 350 

AAA GCA AGC ACG AUU GGU CAA L^UA AUC AAA UUU CAA UAU GAG UOJ AUG 1103 
Lys Ala Ser Thr He Gly Gin Leu He Lys Phe Gin Tyr Glu Ser Met 
355 360 365 

GAU AGU GAA AUG CGC GAA UAC AUA GUA GGA GGA ACJ CUC ACA CAA CAG 1151 
Asp Ser Glu Met Arg Glu Tyr He Val Gly Gly Thr Leu Thr Gin Gin 
370 375 380 

ACA UUC AAC ACA CUU CUU AAG AUG CUU ACG AAA AAC AUG UUC AAA CCA 1199 
Ihr Phe Asn Thr Leu Leu Lys Met Leu Thr Lys Asn Met Phe Lys Pro 
385 390 395 

GAG CGC AUC AAG CAG AUA ALTJ GAA GAG GAA CCU UUC UUA CUU AUG AUG 1247 
Glu Arg He Lys Gin He He Glu Glu Glu Pro Phe Leu Leu Met Met 
400 405 410 415 

GCG AUU GCG UCU CCA ACG GUA UUA AUA GCA OJA UAU AAU AAU UGU UAU 1295 
Ala He Ala Ser Pro Thr Val Leu He Ala Leu Tyr Asn Asn Cys Tyr 
420 425 430 

AUU GAG CAA GCU AUG ACA UAC UGG AUC GUU AAG AAU CAA GGA GUU GCA 1343 
He Glu Gin Ala Met Thr Tyr Trp He Val Lys Asn Gin Gly Val Ala 
435 440 445 

GCC AUA UUC GCA CAA CUC GAA GCA UUA GCC AAG AAA ACA UCC CAG GCU 1391 
Ala He Phe Ala Gin Leu Glu Ala Leu Ala Lys Lys Thr Ser Gin Ala 
450 455 460 

UUA GUU OIA CAA AUG CAG AUA CUU GAA AAA GCA UCU AAC CAA 1439 
Leu Val Leu Gin Met Gin .He Leu Glu Lys Ala Ser Asn Gin 



GAG CUA 
Glu Leu 
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465 470 475 

UUA AGA UUA GCA GUJ UCA GGA Cw^ CAU Air GA^ 1487 
Leu Arg Leu Ala Val Ser Gly Leu Ser His lie Asp Pro Ala Lys Arg 
480 485 490 495 

CTUUUGUCXSUCACACCJaGAAGCGATOtXlA^ 1535 
Leu Leu Trp Ser His Leu Glu Ala Mfec Ser Ihr Arg Ser Glu Met Asa 
500 505 510 

AAG GAG UUA AUA GCTJ GAG GGG XJMJ GCA CUk tIAU GAC GAG CGC OJA UAC 1583 
Lys Glu Leu He Ala Glu Gly lyr Ala Leu Tyr Asp Glu Arg Leu Tyr 
515 520 525 

ACCCOGATCGAAAAAAGUUACGCCVGAUCaAUUAAACCAAU^ GCA 1631 
ISir Leu Mec Glu Lys Ser Tyr Val Asp Gin Leu Asn Gin Ser Tcp Ala 
530 535 540 

GAAUUGlXIIAlJACOTUGGAAAAUUUUCAGCAAUAt^ 1679 
Glu Leu Ser Tyr Cys Gly Lys Phe Ser Ala He 'Trp Arg Val Phe Arg 
545 550 555 

GUCAGGAAGUAUUACAAACCGLrUUUAACCGUGAGAAAAAGC^ 1727 
Val Arg Lys Tyr Tyr Lys Pro Ser Leu Thr Val Arg Lys Ser Val Asp 
560 565 570 575 

UUA GGC GCU GUA UAC AAU AlJffV UCA GOT ACG CAU CT^ 1775 
Leu Gly Ala Val Tyr Asn He Ser Ala Ifer His Leu He Ser Asp Leu 
580 585 590 



GCG CGG AAA AGU CAA GAU CAA GUC AGC 'JOT AUU OTA ACC .AAA OTC CGC 1823 
Ala Arg Lys Ser Gin Asp Gin Val Ser Ser He Leu Thr Lys Leu Arg 
595 600 605 

AAC GGU UUU UAU GAU AAA OTA GAGAAAGOTAGAAUACGAAOTAUAAAA 1871 
Asn Gly Phe Tyr Asp Lys Leu Glu Lys Val Arg He Arg Thr He Lys 
610 615 620 

ACG GOT UAU tX3G OTU Aim OOT GAU AIJA OTU AGA COT GUG C^ 1919 
Thr Val Tyr Trp Phe He Pro Asp He Phe Arg Leu Val His He Phe 
625 630 -635 

AUAGOTOTGAGUOTAOTAAOTACCAUCGOTAACAOTAOTAUAGCAACT 1967 
He Val Leu Ser Leu Leu Ihr Thr He Ala Asn Thr He He Val Thr 
640 645 650 655 

AOTAAUGACUACAAGAAAOTGAAGAAGCAACV^.AGAGAAGACGAAC^ 2015 
Met Asn Asp Tyr Lys Lys Leu Lys Lys Gin Gin Arg Glu Asp Glu Tyr 
660 665 670 

GAA GCA GAA AOT AGO GAA GOT CGC AGA AUC CAU UOT ACC OTA AUG GAA 2063 
Glu Ala Glu He Ser Glu Val Arg Arg He His -er Thr Leu Met Glu 
675 580 635 

GAG CGG AAG GAC AAU CUG ACG uGU GAA ' CAA OTU ALV GAG UAU AUG COT 2111 
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Glu Arg Lys Asp Asn Leu Thr Cys Glu Gin ?he He Glu Tyr iMet Arg 
€90 695 700 

CMA AAU GAU CCA CGG CUA GUU GGA Q3A ACA CDS GAC UUG ACJ CAC ACA 2159 
Xaa Asn His Pro Arg Leu Val Gly Xaa T!hr Leu Asp Leu Tte His Tbr 
705 710 715 

GGUGCKrAUACAUGAAQGGAAAtXrAAUCTCGAAACCAAUUUGGAACM 2207 
Gly Val He His Glu Gly Lys Ser Asn Leu Glu Thr Asn Leu Glu Gin 
720 725 730 735 

UCAAUGGCAGUaGGAACCUTOAUAACAATOAUACUUGA^ 2255 
Ser Met Ala Val Gly Ihr' Leu lie ISir Met lie Leu Asd Pro Gin Lys 
740 745 750 

AGCGAUGCUGUCUaDAAGGOGUTOAACAAAAUGCGGACAG^ 2303 
Ser Asp Ala Val Tyr Lys Val Leu Asn Lys Met Arg rbx Val He Ser 
755 760 765 

ACAAUUGAACAA.:^GUCarVUUCCCUUCAGyGAAUUTC 2351 
Ihr He Glu Gin Asn Val Pro Phe Pro Ser Val Asn Phe Ser Asn ^le 
770 775 780 

UUAACACCaCCAGtXSGCACAACAGAGaGCAGAUGUUGAUGAGa^ 2399 
Leu Thr Pro Pro Val Ala Gin Gin Ser Val Asp Val Asd Glu Pro Leu 
785 790 795 

ACACTUAGCACUGAUAAAAAUUUAACAAIJCVGACUUUGACACAA^ 2447 
Ihr Leu Ser Ihr Asp Lys Asn Leu Thr He Asd Phe Asp Thr Asn Gin 
800 805 810 315 

GAU ^UUA CCUGCCGAUACAUCrAGUAAIJGAUGOTACAUUUGRAGAUlX^ 2495 
Asp Leu Pro Ala Asp Thr Phe Ser Asn Asp Val Thr Phe Xaa Asp Trp 
820 825 830 

OSG UCA ;>iMU CAA uTJA AGC AAC AAC AGA ACA GUG SCA CAC 2543 
Trp Ser Xaa GLa Leu Ser Asn Asn Arg Thr Val Xaa His Tyr Arg Xaa 
835 840 . 845 

aSGGGGGAAAGUYCAUUGGAAUUCACAOSAGAAAACGCAGCCGACACG 2591 
Trp Gly Glu Ser Xaa Leu Glu Phe Thr Arg Glu Asn Ala Ala His Thr 
850 855 860 

AGCAirGAAaXJGCACACUC\AACAUUGAGAGGGAAUU:UlX3ar 2639 
Ser He Glu Leu Ala His Ser Asn He Glu Arg Glu Phe Leu' Leu ^rg 
865 870 875 

GGAGCAGuTTGGCUCGQGAAAAUCCACUGGGUUACCAUACCAUaJU^ 2687 
Gly Ala Vai Gly Ser Gly Lys Ser Thr Gly Leu Pro Tyr His Leu Ser 
880 885 890 395 

AUG CGC GGA AAA GUG CU3 CUA OJA GAG CCU ACA AGA CCG CJA GCU GAG 2735 
Met Arg Gly Lys Val Leu Leu Leu Glu Pro Thr Arg Pro Leu Ala Glu 
900 905 910 
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AAC GUG OSU i«3G CAA CIR CAA GGA CCX; a::A ITja AAC GUA AOJ CCA ^ 2783 
Asn Val Cys Arg Gin Leu Gin Gly Pro Pro Phe Asn Val Ser Pro Thr 
915 920 925 

OJUCAAADSCGUGGAUaAAGOUCCUUUGGAUGCAOJCCAAUCACAAIX: 2831 
Leu Gin Mec Arg Gly Leu Ser Ser Phe Gly Cys Thr Pro He Thr He 
930 935 

AUGACAUCJGGOUOCGCAUUGCACAUGUACGCAAAUAAUCCAGaUAAA 2879 
Met Thr Ser Gly Phe Ala Leu His Mec Tyr Ala Asn Asn Pro Asp Lys 
945 350 955 

ADA UCU GftG UAC GMJ UUC AIIA AIT UUU GAU GAA UGU CAU AUA AUG GAA 2927 
He Ser Glu Tyr Asp Phe lie He Phe Asp Glu Cys His He Met Glu 
960 965 970 975 

GCA CCA GC3 AIX3 GCC UUU UaU IXSJ OlA CX AAA GAA UAU GAA UMJ CGA 2975 
Ala Pro Ala iMec Ala Phe Tyr C/s Leu Leu Lvs Glu Tvr Glu Tyr Arg 
980 985 ' 990 

GGAAAAADUAUCAftGGUAUCAGCUACGCOJCCAGGA.^WSGGAGUGUGAA 3023 
Gly Lys He He Lys Val Ser Ala Ihr Pro Pro Gly .:^rg Glu Cys Glu 
995 1000 1005 

UUC ACA ACA CAA can CCA GUA GAC AUC CAU G-JU UGU GAG .VIU am ACU 307^ 
Phe Thr Thr Gin His Pro Val Asp He His Val Cys Glu Asn Leu Uir 
1010 1015 1020 

CAGCAACAGUUUGUUAIXSGAACUCGGGACJGGU'JCAACCGCAGAUGOJ 3119 
Gin Gin Gin Phe Val Met Glu Leu Gly Ihr Gly Ser Thr Ala Asp Ala 
1025 1030 1035 

ACG AAG -JAC GGA AAU AAU AUC QUA GUU UAU GUA GCA AGC 'uAU AAU GAC 3167 
Thr Lys TVr Gly Asn Asn He Leu Val Tyr Val Ala Ser Tyr Asn Asp 
^O'^O 1045 1050 1055 

GOC GAU UCA UOS UCG CAA GCA CUA GUC GAA CUU AAA UUU UCC GUA AUC 321'= 
Val Asp Ser Leu Ser Gin Ala Leu Val Glu Leu Lys Phe Ser Val He 
1060 1065 1070 

AAAGOSGAUGGCCGAACAAUSAAACAAAACACAACAGGAAUCAUUACrv 3263 
i-ys Val Asp Gly Arg Thr Mec Lys Gin Asn Thr Thr Gly He He Thr 
1075 1080 1085 

AAC GGu ACC GCA CAA AAG AAG UGU UL-U GUU GCX: GCA ACG .^AU AUA AUU 3311 
Asn Gly Thr Ala Gin Lys Lys Cys Phe Val Val Ala Thr Asn rig rig 
1090 1095 11^00 

GAG AAD GGC GUC ACA CJA GAU ALT! GAU GtJU GGU GUC GAG 'JUC GGA CUU 3359 
Glu Asn Gly Val Thr Leu Asp He Asp Val Gly Val Aso Phe Gly Leu 
1105 1110 1115 

AAA GUC UCA GCU GAC LIXS GAC GUU GAC AAC AGG GCG CJA UCG UAU AAA 3407 
Lys Val Ser Ala Asp Leu Asp Val Asp Asn Arg Ala Vai Leu Tyr Lys 
1120 1125 1130 1135 



wo 97/02352 



-37- 



PCT/EP9dAtt673 



CC3C GUA ACa AUA UCA UAU QGU GAA CUC AHA CAA CGA UTO OGU CGU GUU 3455 
Arg Val Ser lie Ser Tyr Gly Glu Leu lie Gin Arg h&i Gly Arg Val 
1140 1145 1150 

QGC AGA AAU AAA COJ GGU ACA GUU AUU CX3A AUC GGA AAA ACA AD3 AAA 3503 
Gly Arg Asn Lys Pro Gly Hir Val lie Arg lie Gly Lys Thr Met Lys 
1155 1160 1165 

GGa UU3 GAG GAA AUD CCA OCA ADG AUC GCA ACA GAG GCA GCC UUC AUG 3551 
Gly Leu Gin Glu He Pro Ala Met He Ala Ihr Glu Ala Ala Phe Met 
1170 1175 1180 

IXSa WC GOJ UAC GGU CUD AAA GUU AUC ACU CAU AAU GUU UCA AOG ACC 3599 
Cys Phe Ala TVr Gly Leu Lys Val He Tbr His Asn Val Ser T!hr Thr 
1185 1190 1195 

CAU OJU GCA AAG UGC ACA GUU AAA CAA GCG AGA ACC AUG AUG CAA UUU 3647 
His Leu Ala Lys Cys Thr Val Lys Gin Ala Arg Thr Met Met Gin Phe 
1200 1205 1210 1215 

GAA UUA UCA CCA UUU GUC AUG GOJ GAG CUC GUU AAG UUU GAU GGU UCA 3695 
Glu Leu Ser Pro *e Val Met Ala Glu Leu Val Lys Phe Asp Gly Ser 
1220 1225 1230 

AUG CAU CCA CAA AUA CAU GAG GCA CUA GUA AAA UAC AAA CUU AGA GAU 3743 
Met His Pro Gin He His Glu Ala Leu Val Lys Tyr Lys Leu Arg Asp 
1225 1240 1245 

UCU GUC AUA AUG CUC AGA CCG AAU GCA CUU CCA AGG GUC AAU UUA CAU 3791 
Ser Val He Met Leu Arg Pro Asn Ala Leu Pro Arg Vai Asn Leu His 
1250 1255 1260 

AAU U3G CJU ACA GCC CGA GAU UAU AAU AGA AUA GGA UGU UCA UUA GAA 3839 
Asn Trp Leu Thr Ala Arg Asp Tyr Asn Arg He Gly Cys Ser Leu Glu 
1265 1270 1275 

CUC GAA GAC CAC GUC AAA AUU CCG UAC UAC AUU AGG GGA GUU CCU GAC 3887 

Glu Asp His Val Lys He Pro Tyr Tyr He Arg Gly Val Pro Asp 
1280 1285 1290 1295 

AAG UUG uAU GGA AAG CUA UAU GAU AUU AUC UUA CAG GAU AGU CCA ACU 3935 
Lys Leu Tyr Gly Lys Leu Tyr Asp He He Leu Gin Asp Ser Pro Thr 
1300 1305 1310 

AGU UGC UAC AGU AGA CUA UCA AGU GCG UGU GCA GGU AAA GUA GCA UAU 3983 
Ser Cys Tyr Ser Arg Leu Ser Ser Ala Cys Ala Gly Lys Val Ala Tyr 
1315 1320 1325 

ACJ CUG GGA ACJ GAU CCA UUU UCA CUU CCA AGA AC\ AUA GCA AUA AUU 4031 
Thr Leu Arg Thr Asp Pro Phe Ser Leu Pro Arg Thr He Ala He He 
133G 1335 1340 

AAU GCC OVA AUC ACG GAG GAG UAU GCG AAG AGA GAU CAC UAU CGU AAC 4079 
Asn Ala Xaa He Thr Glu Glu Tyr Ala Lys Arg Asp His Tyr Arg Asn 
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1345 1250 1355 

AU3i^^/C\AACCCr^lX:iJUCAUCACAC^ 4127 
Met He :caa Asn Pro Ser Ser Ser His Ala Phe Ser Leu Asn Gly Leu 
1360 1365 1370 1375 

GUG UCU AUG AUG GCU AOJ AGA tMJ AUG AAA GAC GAC ACA AAG GAG AAU 4175 
Val Ser Mer lie Ala Ibr Arg Tyr Met Lys Asp His Ihr Lys Glu Asa 
1380 1385 1390 

AUU GAC AAA CUC ADC AGA GUG CGU GAU CAA UUA OJU GAG UUU CAA GGQ 4223 
He Asp Lys Leu He Arg Val Arg Asp Gin Leu Leu Glu Phe Gin Gly 
1395 1400 1405 

AOJ GGA AUG CAA UUa CAA GAU CCAUCAGAAOJCAUGGAAAUUGGG GCU 4271 
Ibr Gly Met Gin Phe Gin Asp Pro Ser Glu Leu Met Glu He Gly Ala 
1410 1415 1420 

CLrAACACAGUUAUUCACCAAGGAADSGACGCAAUUGCAGCUa^ 4319 
Leu Asn Thr Val He His Gin Gly Miet Asp Ala He Ala Ala Cys He 
1425 1430 1435 

GAGUUAGAAGGACGAroGAAUGCUUCACUUAUACAACGCGAUCUCCUA 4367 
Glu Leu Gin Gly Arg Trp Asn Ala Ser Leu He Gin Arg Asp Leu Leu 
1440 1445 1450 1455 

AUUGCAGGUGGAGUUUUUAUCGGAGGCAUUUUGAUGAUGUGGAGCCUA 4415 
He Ala Gly Gly Val Phe He Gly Gly He Leu Met Met "Trp Ser Leu 
1460 1465 1470 

UUU AOJ AAA UGG AGU AAC ACA AAU GUC uCA CAU CAG GGG AAC AAA 4463 
Phe Thr Lys Itp Ser Asn Thr Asn Val Ser His Gin Gly Lys Asn Lys 
1475 1480 1485 

CGC AGU AGA CAA AAA CUUCGAUUCAAAGAAGCAAGAGACAACAAAUAU 4511 
Arg Ser Arg Gin Lys Leu Arg Phe Lys Glu Ala Arg Asp Asn Lys Tyr 
1490 1495 1500 

GCA UAU GAU GTC ACA GGA UCG GAA GAA UGC OXJ GGC GAG AAU UUU GGA 4559 
Ala TVr Asp Val Thr Gly Ser Glu Glu Cys Leu Gly Glu Asn Phe Gly 
1505 1510 1515 



ACA GCC UAU ACA AAG AAA GGU AAA GGA AAA GGA ACJ AAA GUU GGA CUC 4607 
Thr Ala Tyr Thr Lys Lys Gly Lys Gly Lys Gly Thr Lys Val Gly Leu 
1S20 1525 1530 1535 

GGU GCX3 AAG CAG CAU AAA ULr GAU AUG AUG OAC GGU UUC GAU CCC C^ 4655 
Gly Val Lys Gin His Lys Phe His Met Met Tyr Gly Phe Asp Pro Gin 
1540 1545 1550 

GAGUACAACCJAAUUCGGUUUGUCGAUCCACUCACGGGAGCAACJCJ^ 4703 
Glu Tyr Asn Leu He Arg Kie Val Asp Pro Leu Thr Gly Ala Thr Leu 
1555 1560 1565 

GAU GAA CAA AUC CAU GCC GAU AUA CGC UUA AUU CAA GAG CAC UUC GCU 4751 
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Asp Glu Gin lie His Ala Asp He Arg Leu He Gin Glu His Phe Ala 
1570 1575 1580 

GAA AUU CGU GAG GAG GCA GUG AUU AAU GAC ACA AUU GAA AGG GAG CAG 4799 
Glu He Arg Glu Glu Ala Val He Asn Asp Thr He Glu Arg Gin Gin 
1585 1590 1595 

AUU UAC GGC AAU OCU GGA CUA GAA GCA UUU UUC AUA GAA AAU GGG UCA 4847 
He Tyr Gly Asn Rro Gly Leu Gin Ala Phe Phe He Gin Asn Gly Ser 
1600 1605 1610 1615 

GCA AAC GCU COG AGA GCJU GAU UUA ACA CCA CAU UCA COJ ACA CGA GOT 4895 
Ala Asn Ala Leu Arg Val Asp Leu Ttor Pro His Ser Pro Ihr Arg Val 
1520 1625 1630 

GOC ACA GGU AAU AAC AUA GCA GGG UUC CCA GAA. UAU GAA GGA ACA CUU 4943 
Val Thr Gly Asn Asn He Ala Gly Phe Pro Glu Tyr Glu Gly Thr Leu 
1635 1640 1645 

CGU CAG ACJ GGA ACA GCU AUA ACU AUA CCC AUU GGU CAA GUC CCA AUC 4991 
Arg Gin Ihr Gly Thr Ala He Thr He Pro He Gly Gin Val Pro He 
1650 1655 1660 

GCA AAU GAA GCA GGG GUU GCA CAC GAG UCA AAA UCC AUG AUG AAC GGG 5039 
Ala Asn Glu Ala Gly Val Ala His Glu Ser Lys Ser Met Met Asn Gly 
1665 1670 1675 

UUG GGU GAU UAC ACA CCA AUA UCG CAA CAA UUG UGU CUA GUA CAA AAU 5087 
Leu Gly Asp Tyr Thr Pro He Ser Gin Gin Leu Cys Leu Val Gin Asn 
1680 1685 1690 1695 

GAC UCG GAU GGG GUA AAG CGG AAU GUA UUU UCU AUU GGA UAU GGC UCA 5135 
Asp Ser Asp Gly Val Lys Arg Asn Val Phe Ser He Gly Tyr Gly Ser 
1700 1705 1710 

UAU CJU AUU UCA CCA GCG CAC UUA UUC AAA UAC AAC AAU GGU GAA AUA 5183 
Tyr Leu He Ser Pro Ala His Leu Phe Lys Tyr Asn Asn Gly Glu He 
1715 1720 1725 

ACA AUU AGA UCA UCA ,AGA GGA UUG UAC AAA ALTJ CGU AAU UCU GUG GAU 5231 
Thr* He Arg Ser Ser ^rg Gly Leu Tyr Lys He Arg Asn Ser Veil Asp 
1730 1735 1740 

UUA AAA UUA CAU CCA AUU GCA CAC AGA GAC AUG GUC AUA AUU CAA CUC 5279 
Leu Lys Leu His Pro He Ala His Arg Asp Met Val He He Gin Leu 
1745 1750 1755 

CCA AAG GAU UUC CCA CCG UUC CCA AUG CGC UUG AAA UUC GAA CAA CCA 5327 
Pro Lys Asp Phe Pro Pro Phe Pro Met Arg Leu Lys Phe Glu Gin Pro 
1760 1765 1770 1775 

UCA CGA GAU AUG CGA GUC UGC CTA GUA GGA GUC %AC UUC CAA CAG AAU 5375 
Ser Arg Asp Met Arg Val Cys Leu Val Gly Vai . -^n Phe Gin Gin Asn 
1780 1785 1790 
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UAU AGC AOJ JJGC AUG GOA UCA GAA ACT AGU GUG ACA GCA CCA AAA GGA 5423 
Tyr Ser Thr Cys lie Val Ser Glu Ser Ser Val Thr Ala Pro Lys Gly 
1795 1800 1805 

AAU OGA GAC UUU TOG AAA CAU U3G AtlA UCA ACA GUC GAC GGU CAA OGU 5471 
Asn Gly Asp Phe Ttp Lys His Trp lie Ser TSir Val Asp Gly Gin Cys 
1810 1815 1820 

GGA CJA CCA UUG GUA GAU ACJ AAG AGC AAA CAU AUU GUC GGA AUU CAU 5519 
Gly Leu Pro Leu Val Asp Thr Lys Ser Lys His He Val Gly He His 
1825 1830 1835 

AGU CUU GCA t]CA ACA AOJ.GGA AAC AOJ AAU UUC UUU GUC GCU GUG CCa 5567 
Ser Leu Ala Ser Thr Ser Gly Asn Thr Asn Phe Phe Val Ala Val Pro 
1840 1845 1850 1855 

GAGAACUUUAAUGAAUACAirAAUQGACUCGUGCAAGCAAAUAAAUQG 5615 
Glu Asn Phe Asn Glu Tyr He Asn Gly Leu Val Gin Ala Asn Lys Trp 
I860 1865 1870 

GAA AAA GGA UGG CAC UAU AAU CCG AAU CUC AUA UCC UGG UGU GGA CUA 5663 
Glu Lys Gly Trp His Tyr Asn Pro Asn Leu lie Ser Trp Cys Gly Leu 
1875 1880 1885 

AAU UUA 'GUU GAU UCA GCC CCA AAA GGU UUG UUU AAA ACG UCA AAA UUG 5711 
Asn Leu Val Asp Ser Ala Pro Lys Gly Leu Phe Lys Thr Ser Lys Leu 
1890 1895 1900 

GUA GAA GAC UUG GAC GCG AGC GUU GAA GAG CAA UGC AAG AUC ACC GAA 5759 
Val Glu Asp Leu Asp Ala Ser Val Glu Glu Gin Cys Lys lie Thr Glu 
1905 1910 1915 

ACA UGG CJC ACA GAG CAA UUA CAA GAU AAU UUA CAA GUG GUU GCG AAA 5807 
Thr Trp Leu Thr Glu Gin Leu Gin Asp Asn Leu Gin Val Val Ala Lys 
1920 1925 1930 1935 

UOJ CQA GGC CAA CUA GUU ACC AAG CAU GUU GUU AAG GGU CAA UGC CCA 5855 
Cys Pro Gly Gin Leu Val Thr Lys His Val Val Lys Gly Gin Cys Pro 
1940 1945 ' 1950 

CAC UUU CAA UUG UAC UUA UCA ACA CAU GAC GAU GCU AAA 'GAA UAC UUC 5903 
His Phe Gin Leu Tyr Leu Ser Thr His Asp Asp Ala Lys Glu Tyr Phe 
1955 I960 . 1965 

GCA CCC ALX; CUU GGA AAA UAC GAC AAG AGU AGG OJU AAC AGA GCA GCU 5951 
Ala Pro Mec Leu Gly Lys Tyr Asp Lys Ser Arg Leu Asn Arg Ala Ala 
1970 1975 1980 

UUU AUC AAA GAC AUA UCA AAA UAU GCA AAA CCA AUU UAC AUU GGA GAA 5999 
Phe He Lys Asp He Ser Lys Tyr Ala Lys Pro He Tyr He Gly Glu 
1985 1990 1995 

AUC GAG UAU GAU AUC UUU GAU AGA GCU GUA CAG CGG GUU GUC AAU AUU 6047 
He Glu Tyr Asp He Phe Asp Arg Ala Val Gin Arg Val Val Asn He 
2000 2005 2010 2015 
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cue AAA AAU GUO GGA AUG CAA CAA UGC GUU UAU GUC ACA GAU GAA GAA 6095 
Leu Lys Asn Val Gly Met Gin Gin Cys Val TVr Val Thr Asp Glu Glu 
2020 2025 2030 

GAA AUU UUC AGA UCA CUU AAC CUG AAC GCA GOJ GUC GSA GCA UUG UAU 6143 
Glu lie Phe Arg Ser Leu Asn Leu Asn Ala Ala Val Gly Ala Lea Tyr 
2025 2040 2045 

ACA GGA AAG AAG AAA AAU UAC UUU GAA AAU UUU UCA AGC GAA GAC AAA 6191 
Thr Gly Lys Lys Lys Asn Tyr Phe Glu Asn Phe Ser Ser Glu Asp Lys 
2050 2055 2060 

GAA GAA ADC GUG AUG AGA UCr UGCJ GAACGUAUUUACAAUGGSGAACUU 6239 
Glu Glu He Val Met Arg Ser Cys Glu Arg He Tyr Asn Xaa Gin Lau 
2065 2070 2075 

GGC GUA UGG AAU GGA UCG CUC AAA GOJ GAG AUC AGA CCA AUA GAG AAA 6287 
Gly Val Trp Asn Gly Ser Leu Lys Ala Glu He Arg Pro He Glu Lys 
2080 2085 2090 2095 

ACC AIX5 CJ3 AAU AAGACtJCGAACCUUCACAGCGGCCCCAUUAGAAAaJ 6335 
Hir Met Leu Asn Lys Itar Arg Thr Phe Thr Ala Ala Pro Leu Glu Uir 
2100 2105 2110 

UUG CUC GGA GGA AAA GUG UGC GUG GAU GAU UUU AAU AAU GAA UUC UAU 6383 
Leu Leu Gly Gly Lys VeU. Cys Val Asp Asp Phe Asn Asn Gin Phe Tyr 
2115 2120 2125 

UCA CAU CAU UUA GAA GGU CCA UGG ACU GUU GGG AUA ACA AAA UUC UAU 6431 
Ser His His Leu Glu Gly Pro Trp Ihr Val Gly He Thr Lys Phe Tyr 
2130 2135 2140 

GGA GGU UGG AAU CGC UUA CJG GAG AAG UUA CCA GAA GGA UGG GUU UAC 6479 
Gly Gly Trp Asn Arg Leu Leu Glu Lys Leu Pro Glu Gly Trp Val Tyr 
2145 2150 2155 

UGC GAU GCj GAC GGG UCU CAA UUU GAU AGU UCG UUA ACA CCA UAU CUC 6527 
Cys Asp Ala Asp Gly Ser Gin Phe Asp Ser Ser Leu Thr Pro Tyr Leu 
2160 2165 2170 2175 

AUC AAU GCA GUA UUA AAU AUU CGA UUG GAA UUU AUG GAA GAU UGG GAU . . 6575 
He Asn Ala Val Leu Asn He Arg Leu Gin Phe Met Glu Aso Trp Asd 
2180 2185 " 2190 

AUA GGA GCG CAA AU3 OIA AAG AAC CUG UAC ACJ GAG AUU GUU JJPC ?^ 6623 
He Gly Ala Gin Met Leu Lys Asn Leu Tyr Thr Glu He Val Tyr Thr 
2195 2200 2205 

CCA AUC GCA ACG CCA GAC GGA UCA AUC GUG AAG UUC AAA GGU AAC 5671 
Pro He Ala Thr Pro Asp Gly Ser He Val Lys Lys Phe Lys Gly Asn 
2210 2215 2220 

AAU AGC GGA CAA ecu UCJ ACA GUA GUG GAC AAC ACA UUG AUG GUU AUA 6719 
Asn Ser Gly Gin Pro Ser Thr Val Val Asp Asn Thr Leu Mec Val He 



wo 97/02352 



PCT/EIW02673 



. 42 - 



2225 2230 2235 

AUAGCUUUCAACUAUGCTj^aiAIXilAA^ 5767 
He Ala Phe Asn Tyr Ala Met Leu Ser Ser Gly He Lys Glu Glu Glu 
2240 2245 2250 2255 

GAU AAU UGC O^U AGA ATO UTC GCG AAU GGU GAU 6815 
lie Asp Asn Cys Cys Arg Met Phe Ala Asn Gly Asp Asp Leu Leu L^ 
2260 2265 2270 

GCAGUSGAUCCaGAULTJUGAGUDCADUUUAGAUGA^ 6863 
Ala Val His Pro Asp Phe Glu Phe He Leu Asp Glu Phe Gin Asn His 
2275 2280 2285 

UUUQGGAAUCUUGGGCTOAACUUCGAAUraACAUCAC^ 6911 
Phe Gly Asn Leu Gly Leu Asn Phe Glu Phe Ihr Ser Arg Thr Arg Asp 
2290 2295 2300 

AAAOXGAACroOSGUUCAUGUCCACAAGAGGCAUCA^ 6959 
Lys Ser Glu Leu Ttp Phe Met Ser Thr Arg Gly He Lys TVr Glu Gly 
2305 2310 2315 

AUUaACAUACCAAAGOJUGAGAAAGAAAGAAlS^ 7007 
He TVr He Pro Lys Leu Glu Lys Glu Arg He Val Ala He Leu Glu 
2320 2325 2330 2335 

IX3G <3VU CGA UC^i AAC UTO COT GAA CAU AGG LTX3 GAA GCU AUA UGU GCA 7055 
Tip Asp Arg Ser Asn Leu Pro Glu His Arg Leu Glu Ala He Cys Ala 
2340 2345 2350 

Gp AIX; GUU C»3 GCC UGG GGA UCC GAU aJC GOJ CAU GAA AUA CGA 7103 
Ala Met Val Glu Ala Trp Gly Tyr Ser Asp Leu Val His Glu He Arg 
2355 2360 2365 

AAGUTCUADGCGaSGOKJLTOGAAi^CAACOTUUUGCAAA^ 7151 
Lys Phe Tyr Ala Trp Leu Leu Glu Met Gin Pro Phe Ala Asn Leu Ala 
2370 2375 2380 

AAANAAGGGUTOGCCCCAUACADUGCCGAGACAGCACirC^ 7199 
Lys Xaa Gly Leu Ala Pro TVr He Ala Glu Thr Ala Leu Arg Asn Leu 
2385 2390 2395 

UAU CTU GGA ACG GGU ATC AAA GAG GAA GAA AUU GAA AAA UAU OJU ^ 7247 
Tyr Leu Gly Thr Gly He Lys Glu Glu Glu He Glu Lys TVr Leu Lys 
2400 2405 2410 2415 

CAA UUC AL^J AAG GAU OJU CCC GGA uAC AUA GAA GAU UAC AAU GAA GAU 7295 
Gin Phe He Lys .Asp Leu Pro Gly Tyr He Glu Asp Tyr Asn Glu .Asp 
2420 2425 2430 

GUA UUC CAU GAG UCG GGAACUGUJ GAU GCGGGUGCACAAGGCGGCAGU 7343 
Val Phe His Gin Ser Gly Thr Val Asp Ala Gly Ala Gin Gly Gly Ser 
2435 2440 2445 



GGA AGC CAA GGG ACA ACA CCA CCA GCA .ACA GGo AGU GGA GCA AAA CCA 



7391 
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Gly Ser Gin Gly Thr Thr Pro Pro Ala Thr Gly Ser Gly Ala Lys Pro 
2450 2455 2460 



GCC ACC UCA 
Ala Tbr Ser 
2465 


GGG 
Gly 


GCA 
Ala 


GGA UCJ GGCJ 
Gly Ser Gly 
2470 


AGU 
Ser 


GAC 
Asp 


ACA GGA GCU 
Thr Gly Ala 
2475 


GGA ACU 
Gly Ihr 


GGU 
Gly 


7439 


GUA ACU 
Val Thr 
2480 


GGA 
Gly 


AGU 
Ser 


CAA 
Gin 


GCA AGG 
Ala Arg 
2485 


AOJ 
Thr 


GGC 
Gly 


AGU 
Ser 


GGC ACU GGG 
Gly Thr Gly 
2490 


ACG GGA 
Thr Gly 


UCU 
Ser 
2495 


7487 


GGA GCA 
Gly Ala 


ACC 
Thr 


GGA 
Gly 


GGC CAA 
Gly Gin 
2500 


UCA 
Ser 


GGA 
Gly 


UCU 
Ser 


GGA AGU GGC ACU 
Gly Ser Gly Thr 
2505 


GAA CAG GUU 
Glu Gin Val 
2510 


7535 


AAC ACG 
Asn Thr 


GGU 
Gly 


UCA GCA 
Ser Ala 
2515 


GGA 
Gly 


ACU 
Thr 


AAU 
Asn 


GCA ACU 
Ala Thr 
2520 


GGA GGC CAA 
Gly Gly Gin 


AGA GAU 
Arg Asp 
2525 


AGG 
Arg 


7583 


GMJ GUG 
Asp Vai 


GAU GCA 
Asp Ala 
2530 


GGC 
Gly 


UCA 
Ser 


ACA 
Thr 


GGA AAA 
Gly Lys 
2535 


AUU 
lie 


LrU GUA CCA AAG cue 
Ser Val Pro Lys Leu 
2540 


AAG 
Lys 


7631 


GCC AU3 UCA 
Ala Met Ser 
2545 


AAG 
Lys 


AAA 
Lys 


AUG 
Met 


CGC UUA 
Arg Leu 
2550 


ecu 

Pro 


AAA 

Lys 


GCA AAA GGA 
Ala Lys Gly 
2555 


AAA GAU 
Lys Asp 


GUG 
Val 


7679 


CUA CAU 
Leu His 
2550 


UUG 
Leu 


GAU 
Asp 


UUU 
Phe 


CJA UUG 
Leu Leu 
2565 


ACA 
Iter 


UAC 
TVr 


AAA 
Lys 


CCA CAA CAA 
Pro Gin Gin 
2570 


CAA GAC 
Gin Asp 


AUA 
He 
2575 


7727 


UCA AAC 
Ser Asn 


ACJ 
Thr 


AGA 

Arg 


GCA ACC 
Ala Thr 
2580 


AAG 

Lys 


GAA 
Glu 


GAG 
Glu 


UUU GAU AGA UGG 
Phe Asp Arg Trp 
2585 


UAU GAU GCC 
Tyr Asp Ala 
2590 


7775 


AUA AAG 
He Lys 


AAG 
Lys 


GAA UAC 
Glu Tyr 
2595 


GAA 
Glu 


AUU 
lie 


GAU 
Asp 


GAC ACA 
Asp Thr 
2600 


CAA AU3 ACA 
Gin Met Thr 


GUU GUC 
Val Val 
2605 


AUG 
Met 


7823 


AGU GGC 
Ser Gly 


CJU AUG 
Leu Met: 
2610 


GUA 
Val 


UGG 
Ttp 


UGC 
Cys 


AUC GAA 
He Glu 
2615 


AAU 
Asn 


GGU UGC UCA CCA AAC 
Gly Cys Ser Pro Asn 
2620 


AUA 
He 


7871 


AAC GGA AAU UGG 
Asn Gly Asn Tro 
2625 


ACA 
Thr 


AUG 
Met 


AUG GAU AAA 
Met Asp Lys 
2630 


GAU 
Asp 


GAA CAA AGG GUC UUC 
Glu Gin Arg Val Phe 
2635 


CCA 
Pro 


7919 


cue AAA CCG GUC 
Leu Lys Pro Vai 
2640 


AUU 
He 


GAG AAU GCA LTTJ 
Glu Asn Ala Ser 
2645 


CCA 
Pro 


.ACU UUC CGA CAA AUU 
Thr Phe Arg Gin He 
2650 


AUG 
Met 
2655 


7967 


CAU CAU UUC 
His His Phe 


AGU 
Ser 


GAU GCA GCU GAA GCG 
Asp Ala Ala Glu Ala 
26iS0 


UAC AUA GAG UAC AGA .2AC UCU 
Tyr lie Glu Tyr Arg .Asn Ser 
2665 2670 


8015 
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AOJ GAG CGA UAU ATO CCA AGA UAC GGA CUU CGC AAU CUC ACC GAC 
Thr Glu Arg Tyr Met Pro Arg Tyr Gly Leu Gin Arg Asn Ijeu Thr Asp 
2675 2630 2685 



8063 



UAU AGC UUA GCA CGG UAU GCA UUU GAU UUC UAU GAA AUG AOJ UCA OSC 8111 
Tyr Ser Leu Ala Arg Tyr Ala Phe Asp Phe Tyr Glu Met Tbr Ser Arg 
2690 2695 2700 

ACA ecu GCU AGA GCU AAA GAA GCC CAC AUG CAG AUG AAA GCC GCA GCA 8159 
rt)r Pro Ala Arg Ala Lys Glu Ala His Met Gin Met Lys Ala Ala Ala 
2705 2710 2715 

GUU CGU GGU UCA AAC ACA CGA COS UUC GGU UUG GAU GGA AAU GUC GGC 8207 
Val Arg Gly Ser Asn Thr Arg Leu Phe Gly Leu Asp Gly Asn Val Gly 
2720 2725 2730 2735 

GAG ACJ CAG GAG AAU ACA GAG AGA CAC ACA GCJ GGC GAU GUU AGU CGC 8255 
Glu Thr Gin Glu Asn Hrr Glu Arg His Thr Ala Gly Asp Val Ser Arg 
2740 2745 2750 

AACAUGCACUCUCuGUUGGGAGUGCAGCAACACCAC UAGUCUCCUG 8301 
Asn Met His Ser Leu Leu Gly Val Gin Gin His His 
2755 2760 

GAAACCCJGU UUGCAGUACC AAUAAUAUGU ACUAAUAUAU AGUAUUUUAG UGAGGUUUUA 8361 

CCUCGUCUUU ACUGUUUUAU UACGUAUGUA UUUAAAGCGU GAACCAGUCJ GCAACAUACA 8421 

GGGUUGGACC CAGUGTOUUC UGGUGOAGCG UGUACUAGCG L^GAGCCAUG AGAUGGACUG 8481 

CACJGGGUGU GGUUUU3CCA CUUGUGUUGC GAGUOrCJG GUAAGAGACA AAAAAAAAAA 8541 
AA 



(2) INPORMATICN PGR 3SQ ID N0:2; 

(i) SBQUENCS CHARACTERISTICS ; 

(A) LETZSriH: 2763 amino acids 

(B) TYPE: amino acid 
(D) TOPOLCX^V: linear 

(ii) >DLECULE TYPE: protein 

(xi) SB3UH^ DESCHIPTICN: SBQ ID N0:2: 

Glu Glu Lys Gin Glu Tyr Leu Ala Lys Asd Gin Lys Leu Ser Arg 
15 10 15 

Met lie Gin Phe He Lys Glu Arg Cys Asn Pro Lys Phe Ser His Leu 
20 25 30 

Pro Thr Leu Trp Gin Val Ala Glu Thr He Gly :s Tvr Thr .J^sp Asn 
35 40 45 



8543 
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Gin Ser Lys Gin He Met Asp Val Ser Glu Ala Leu He Lys Val Asn 
50 55 60 

llir lieu Ihr Pro Asp Asp Ala Met Lys Ala Ser Ala Ala Leu Leu Glu 
65 70 75 80 

Val Ser Arg Trp Tyr Lys Asn Arg Lys Glu Ser Leu Lys iJir Asp Ser 
85 90 95 

Leu Glu Ser Phe Arg Asn Lys He Ser Pro Lys Ser Hir lie Asn Ala 
100 105 110 

Ala Leu Met Cys Asp Asn Gin Leu Asp Lys Asn Ala Asn Phe Val Trp 
115 120 125 

Gly Asn Arg Glu Tyr His Ala Lys Arg Phe Phe Ala Asn Tyr Phe Xaa 
130 135 140 

Ala Val Asp Pro Thr Asp Ala Tyr Glu Lys His Val Thr Arg Phe Asn 
145 150 155 160 

Pro Asn Gly Gin Arg Lys Leu Ser He Gly Lys Leu Val He Pro Leu 
165 170 175 

Asp Phe Gin Lys He Arg Glu Ser Phe Val Gly Leu Ser He Asn Arg 
180 185 190 

Gin Pro Leu Asp Lys Cys Cys Val Ser Lys He Glu Gly Gly Tyr He 
195 200 205 

Tyr Pro Cys cys Cys Val Thr Thr Glu Phe Gly Lys Pro Ala Tyr Ser 
210 215 220 

Glu He He Pro Pro Thr Lys Gly His He Thr He Gly Asn Ser He 
225 230 235 240 

Asp Ser Lys He Val Asp Leu Pro Asn Ihr Thr Pro Pro Ser Met Tyr 
245 250 255 

He Ala Lys Asp Gly Tyr Cys Tyr He Asn He Phe Leu Ala Ala Met 
260 265 270 

He Asn Val Asn Glu Glu Ser Ala Lys Asp Tyr Thr Lys Phe Leu Arg 
275 280 285 

Asp Glu Leu Val Glu Arg Leu Gly Lys Trp Pro Lys Leu Lys Asp Val 
290 295 300 

Ala Thr Ala Cys Tyr Ala Leu Ser Val Met Phe Pro Glu He Lys Asn 
305 310 315 320 

Ala Glu Leu Pro Pro He Leu Val Asp His Glu Asn Lys Ser Met His 
325 330 335 

Val He Asp Ser Tyr Gly Ser Leu Ser Val Gly Phe His He Leu Lys 
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340 345 350 

Ala Ser Thr He Gly Gin Leu lie Lys Phe Gin Tyr Glu Ser Mec ^Vsp 
355 360 365 

Ser Glu Met Arg Glu Tyr He Val Gly Gly Ihr Leu Thr Gin Gin Thr 
370 375 380 

Phe Asn Thr Leu Leu Lys Mec Leu Ihr Lys Asn Mec Phe Lys Pro Glu 
385 390 395 400 

Arg He Lys Gin He . He Glu Glu Glu Pro Phe Leu Leu Met Met Ala 
405 410 415 

He Ala Ser Pro Thr Val Leu He Ala Lai Tyr Asn Asn Cys TVr He 
420 425 430 

Glu Gin Ala Met Ihr Tyr TCP He Val Lys Asn Gin Gly Val Ala Ala 
435 440 445 

He Ehe Ala Gin Leu Glu Ala Leu Ala Lys Lvs Thr Ser Gin Ala Glu 
450 455 460 

Leu Leu Val Leu Gin Mec Gin He Leu Glu Lys Ala Ser Asn Gin Leu 
470 475 480 

Arg Leu Ala Val Ser Gly Leu Ser His He Asp Pro Ala Lys Arg Leu 
485 490 495 

Leu Trp Ser His Leu Glu Ala Met Ser Thr ?>rg Ser Glu Met Asn Lys 
500 505 510 

Glu Leu He Ala Glu Gly Tyr Ala Leu Tyr Asp Glu Arg Leu Tyr Thr 
515 520 525 

L«i Met Glu Lys Ser Tyr Val Asp Gin Leu Asn Gin Ser Trp Ala Glu 
530 535 540 

Leu Ser Tyr Cys Gly Lys Phe Ser Ala He ItD Arg Val Phe Arg Val 
5« 550 555 560 

Arg Lys Tyr Tyr Lys Pro Ser Leu Thr Val Arg Lys Ser Val Asp Leu 
555 570 575 

Gly Ala Val Tyr Asn He Ser Ala Thr His Leu He Ser Asp Leu Ala 
580 585 590 

Arg Lys Ser Gin Asp Gin Val Ser Ser He Leu Thr Lys Leu .^tfg Asn 
595 600 505 

Gly Phe Tyr Asp Lys Leu Glu Lys Val Arg He Arg Thr He Lys Thr 
SIO 615 620 

Val Tyr Trp Phe He Pro Asp He Phe Arg Leu Val His He Phe He 
S25 630 635 640 
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Val Leu Ser Leu Leu ■rhr Ihr lie Ala Asn Thr He He Val Ihr Met 
645 650 655 

Asn Asp Tyr Lys Lys Leu Lys Lys Gin Gin Arg Glu Asp Glu Tyr Glu 
660 665 670 

Ala Glu lie Ser Glu Val Arg Arg He His Ser Thr Leu Met Glu Glu 
675 680 685 

Arg Lys Asp Asn Lai Tlir Cys Glu Gin Stoe lie Glu TVr Met Arg Xaa 
690 695 700 

Asn His Pro Arg Lai Val Gly Xaa Thr Leu Asp Leu Thr His Thr Glv 
705 710 715 720 

Val lie His Glu Gly Lys Ser Asn Leu Glu Thr Asn Leu Glu Gin Ser 
725 730 735 

Met Ala Val Gly Ihr Leu He Thr Met He Leu Asp Pro Gin Lys Ser 
740 745 750 

Asp Ala Val Tyr Lys Val Leu Asn Lys Met Arg Thr Val He Ser Thr 
755 760 765 

He Glu Gin Asn Val Pro Phe Pro Ser Val Asn ?he Ser Asn He Leu 
770 775 780 

Thr Pro Pro Val Ala Gin Gin Ser Val Asp Val ASD Glu Pro Leu Ihr 
790 795 " 800 

Leu Ser Ihr Asp Lys Asn Leu Ihr He Asp Phe Asp Thr Asn Gin Asp 
805 810 815 

Leu Pro Ala Asp Thr Phe Ser Asn Asp Val Thr Phe Xaa Asp Tro Ttd 
320 825 830 " 

Ser Xaa Gin Ser Asn Asn Arg Thr Val Xaa His Tyr Arg Xaa Ttp 
335 340 845 

Gly Glu Ser Xaa Leu Glu She Thr Arg Glu Asn Ala Ala His Thr Ser 
350 855 860 

He Glu Leu Ala His Ser Asn He Glu Arg Glu Phe Leu Leu Arg Gly 
870 875 880 

Ala Val Gly Ser Gly Lys Ser Thr Gly L«i Pro TVr His Leu Ser Met 
885 890 895 

Arg Gly Lys Val Leu Leu Leu Glu Pro Thr Arg Pro Leu Ala Glu Asn 
900 905 910 

Val Cys Arg Gin Leu Gin Gly Pro Pro Phe Asn Val Ser Pro Thr Leu 
915 920 925 
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Gin Met Arg Gly Leu Ser Ser ?he Gly Cys Thr Pro lie Thr He Met 
930 935 

T!hr Ser Gly ?he Ala Leu His Mac Tyr Ala Asn Asn ?to Asp Lys lie 
945 950 955 960 

Ser Glu Tyr Asp ?he lie He Hhe Asp Glu Cys His He Met Glu Ala 
965 970 975 

Pro Ala Met Ala Phe IVr Cys Leu Leu Lys Glu Tyr Glu Tyr Arg Gly 
980 985 990 

Lys He He Lys Val Ser Ala Thr Pro Pro Gly Arg Glu Cys Glu °he 
995 1000 1005 

Ihr Thr Gin His Pro Val Asp He His Val Cys Glu Asn Leu Thr Gin 
1010 1015 1020 

Gin Gin Phe Val Met Glu Leu Gly Thr Gly Ser Thr Ala Asp Ala Tbr 
1025 1030 1035 1040 

Lys lyr Gly Asn Asn He Leu Val Tyr Val Ala Ser Tyr Asn Asp Val 
1045 1050 1055 

Asp Ser Leu Ser Gin Ala Leu Val Glu Leu Lvs Phe Ser Val He Lys 
1060 1065 1070 

Val Asp Gly Arg Thr Met Lys Gin Asn Thr Thr Gly lie He Thr Asn 
1075 1080 1085 

Gly Thr Ala Gin Lys Lys Cys Phe Val Vai Ala Thr .^n He He Glu 
1090 1095 1100 

Asn Gly Val Thr Leu Asp He Asp Val Gly Val Asp Phe Gly Leu Lys 
1105 1110 ^^20 

Val Ser Ala Asp Leu Asp val Asp Asn Arg Ala Val Leu Tyr Lys Arg 
1125 1130 1135 

Val Ser He Ser Tyr Gly Glu Leu He GLn Arg Leu Gly Arg Val Gly 
1140' 1145 1^150 

Arg Asn Lys Pro Gly Thr Val He Arg He Gly Lys Thr Met Lys Gly 
1155 1160 1165 

Leu Gin Glu He Pro Ala Met He Ala Thr Glu Ala Ala Phe Met Cvs 
1170 1175 1180 

Phe Ala Tyr Gly Leu Lys Val He Thr His Asn Vai Ser Thr Thr His 
1185 1190 1195 1200 

Leu Ala Lys Cys Ihr Val Lys GLn Ma Arg. Thr Met Met Gin Phe Glu 
1205 1210 1215 

Leu Ser Pro Phe Val Met Ala Glu Leu Val Lys Phe Asp Gly Ser Met 
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1220 1225 1230 

His Pro Gin lie His Glu Ala Leu Val Lys Tyr Lys Leu Arg Asp Ser 
1235 1240 1245 

Val lie Met Leu Arg Pro Asn Ala Leu Pro Arg Val Asn Leu His Asn 
1250 1255 1260 

Trp Leu Uar Ala Arg Asp Tyr Asn Arg lie Gly Cys Ser Leu Glu Leu 
1265 1270 1275 1280 

Glu Asp His Val Lys He Pro Tyr TVr He Arg Gly Val Pro Asp Lys 
1285 1290 1295 

L©i Tyr Gly Lys Leu Tyr Asp lie He Leu Gin Asp Ser Pro Thr Ser 
1300 1305 1310 

Cys TVr Ser Arg Leu Ser Ser Ala Cys Ala Gly Lys Val Ala Tyr 
1315 1320 1325 

Leu Arg Thr Asp Pro Phe Ser Leu Pro Arg Thr He Ala He He Asn 
1330 1335 1340 

Ala Xaa He Thr Glu Glu Tyr Ala Lys Arg Asp His Tyr Arg Asn Met 
1345 1350 1355 1360 

He Xaa Asn Pro Ser Ser Ser His Ala Phe Ser Leu Asn Gly Leu Val 
1365 1370 1375 

Ser Met He Ala Thr Arg Tyr Met Lys Asp His Thr Lys Glu Asn He 
1380 1385 1390 

Asp Lys Leu He Arg Val Arg Asp Gin Leu Leu Glu Phe Gin Gly Ihr 
1395 1400 1405 

Gly Met Gin Phe Gin Asp Pro Ser Glu Leu Met Glu He Gly Ala Leu 
1410 1415 1420 

Asn Thr Val He His Gin Gly Met Asp Ala He Ala Ala Cys He Glu 
1425 1430 1435 1440 

Leu Gin Gly Arg Trp Asn Ala Ser Leu He Gin Arg Aso Leu Leu He 
1445 1450 " 1455 

Ala Gly Gly Val Phe He Gly Gly He Leu Met Met Trp Ser Leu Phe 
1460 1465 1470 

Thr Lys Trp Ser Asn Thr Asn Val Ser His Gin Gly Lys Asn Lys Arg 
1475 1480 1485 

Ser Arg Gin Lys Leu Arg Phe Lys Glu Ala .2\rg Asp Asn Lys Tyr Ala 
1490 1495 1500 

Tyr Asp Val Thr Gly Ser Glu Glu Cys Leu Gly Glu .^sn Phe Gly Thr 
1505 1510 1515 " 1520 
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Ala Tyr Hir Lys Lys Gly Lys Gly Lys Gly Thr Lys Val Gly Leu Gly 

1530 1535 

val Lys Gin His Lys Phe His Met Met Tyr Gly ?he Asp Pro Gin Glu 
^540 1545 

TVr Asn Leu He Arg ?he Val Asp Pro Leu Thr Gly Ala Hir Leu Asp 
1555 1560 1565 

^rT'^^ Ala Asp lie Arg Leu lie Gin Glu His Phe Ala Glu 
1570 1575 j^^QQ 

lie Arg Glu Glu Ala Val lie Asn Asp Thr He Glu Arg Gin Gin Ue 
^^^^ 1590 1595 1600 

Tyr Gly Asn Pro Gly Leu Gin Ala Phe E4ie He Gin .2\sn Gly Ser Ala 
1205 1610 1615 

Asn Ala Leu Arg Val Asp Leu Ihr Pro His Ser Pro Thr Arg Val Val 
1S20 1625 • 1630 

™^ ^ Phe Pro Glu Tyr Glu Gly Thr Leu Arg 
1635 1640 1645 

V^J^^^ ^ lie Tbr lie Pro He Gly Gin Val Pro He Ala 
1S50 1655 1660 

Asn_Glu Ala Gly Val Ala His Glu Ser Lys Ser Met Met Asn Gly Leu 
^^^^ 1670 1675 1680 

Gly Asp Tyr Itor Pro He Ser Gin Gin Leu Cys Leu Val Gin Asn Asp 
1685 1690 1695 

Ser Asp Gly Val Lys Arg Asn Val Phe Ser He Gly Tyr Gly Ser Tyr 
I'^OO 1705 171Q 

^ '^^ f^/^ Ala His Leu Phe Lys TVr Asn Asn Gly Glu He Thr 
1715 1720 1725 

He Arg Ser Ser Arg Gly Leu Tyr Lys He Arg Asn Ser Val Asp Leu 
1730 1735 1740 



Lys__Leu His Pro He Ala_His Arg Asp Met Val He He Gin Leu Pro 

== 1760 



1745 1750- 175= 



Lys Asp Phe Pro Pro Phe Pre Met Arg Leu Lys Phe Glu Gin Ser 
1765 1770 1775 

Arg Asp Met Arg Val Cys Leu Val Gly Val Asn Phe Gin Gin Asn Tvr 
1780 1785 1790 

Ser Thr Cys He Val Ser Glu Ser Ser Val Thr Ala Pro Lys Gly Asn 
^795 1800 :ao5 
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Gly Asp Phe 'Vrp Lys His 1^ lie Ser Thr Val Asp Gly Gin Cys Gly 
1810 1815 1820 

Leu Pro Leu Val Asp Thr Lys Ser Lys His lie Val Gly He His Ser 
1825 1830 1835 1840 

Leu Ala Ser Thr Ser Gly Asn "ajr Asn Fhe Phe Val Ala Val Pro Glu 
1845 1850 1855 

Asn Phe Asn Glu Tyr He Asn Gly Leu Val Gin Ala Asn Lys Tip Glu 
1860 1865 1870 

Lys Gly Trp His Tyr Asn Pro Asn Leu He Ser Trp Cys Gly Leu Asn 
1875 1880 1885 

Leu Val Asp Ser Ala Pro Lys Gly Leu Kie Lys Thr Ser Lys Leu Val 
1890 1895 1900 

Glu Asp Leu Asp Ala Ser Val Glu Glu Gin Cys Lys lie Ihr Glu Thr 
1905 1910 1915 1920 

Trp Leu Thr Glu Gin Leu Gin Asp Asn Leu Glu Val Val Ala Lys Cys 
1925 1930 1935 

Pro Gly Gin Leu Val Thr Lys His Val Val Lys Gly Gin Cys Pro His 
1940 1945 1950 

Phe Gin Leu Tyr Leu Ser Thr His Asp Asp Ala Lys Glu Tyr Phe Ala 
1955 1960 1965 

Pro Met Leu Gly Lys Tyr Asp Lys Ser Arg Leu Asn Arg Ala Ala Phe 
1970 1975 1980 

lie Lys Asp He Ser Lys Tyr Ala Lys Pro He Tyr He Gly Glu He 
1985 1990 1995 2000 

Glu Tyr Asp He Phe Asp Arg Ala Val Gin Arg Val Val Asn He Leu 
2005 2010 2015 

Lys Asn Val Gly Met Gin Gin Cys Val Tyr Val Ihr Asp Glu Glu Glu 
2020 2025 2030 

He Phe Arg Ser Leu Asn Leu Asn Ala Ala Val Gly Ala Leu Tyr Thr 
2035 2040 2045 

Gly Lys Lys Lys Asn Tyr Phe Glu Asn Phe Ser Ser Glu Asp Lys Glu 
2050 2055 2060 

Glu He Val Met Arg Ser C/s Glu Arg He Tyr Asn Xaa Gin Leu Gly 
2065 2070 2075 2080 

Val Trp Asn Gly Ser Leu Lys Ala Glu He Arg Pro He Giu Lys Ihr 
2085 2090 2095 

Met Leu Asn Lys Thr Arg Thr Phe Thr Ala Ala Pro Leu Glu Thr Leu 
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2100 2105 2110 

Leu Gly Gly Lys Val Cys Val Asp Asp ?he Asn Asn Gin Phe Tyr Ser 
2115 2120 2125 

His His Leu Glu Gly Pro Trp Thr Val Gly He Thr Lys Phe Tyr Gly 
2130 2135 2140 

Gly Trp Asn Arg Leu Leu Glu Lys Leu Pro Glu Gly Trp Val Tyr Cys 
'^145 2150 2155 2160 



Asp Ala Asp Gly Ser Gin Phe Asp Ser Ser Leu Thr Pro Tyr Leu He 
2165 2170 2175 

Asn Ala Val Leu Asn lie Arg Leu Gin Phe Met Glu Asp Trp Asp He 
2180 2185 2190 

Gly Ala Gin Mec Leu Lys Asn Leu Tyr Thr Glu He Val Tyr Thr Pro 
2195 2200 2205 

He Ala Thr Pro Asp Gly Ser He Val Lys Lys Phe Lys Gly Asn Asn 
2210 2215 2220 

Ser Gly Gin Pro Ser Thr Val Val Asp Asn Thr Leu Met Val He He 
2225 2230 2235 2240 

Ala Phe Asn Tyr Ala Met Leu Ser Ser Gly He Lys Glu Glu Glu He 
2245 2250 2255 

Asp Asn Cys Cys Arg Mec Phe Ala Asn Gly Aso Asd Leu Leu Leu Ala 
2260 2265 2270 

Val His Pro Asp Ph^ Glu Phe He Leu Asp Glu Phe Gin Asn His Phe 
2275 2280 2285 

Gly Asn Leu Gly Leu Asn Phe Glu Phe Thr Ser Arg Thr Arg ^ Lys 
2290 2295 2300 

Ser Glu Leu Trp Phe Met Ser Thr Arg Gly He Lys Tyr Glu Gly ^le 
2305 2310 2315 2320 

Tyr He Pro Lys Leu Glu Lys Glu Arg He Val Ala He Leu Glu Trp 
2325 2330 2335 

Asp Arg Ser Asn Leu Pro Glu His Arg Leu Glu Ala He Cys- Ala Ala 
2340 2345 2350 

Met Val Glu Ala Trp Gly Tyr Ser Asp Leu Val His Glu He Arg Lys 
2355 2360 2365 

Phe Tyr Ala Trp Leu Leu Glu Met Gin Pro Phe Ala Asn Leu Ala Lys 
2370 2375 2380 

Xaa Gly Leu Ala Pro Tyr He Ala Glu Thr Ala Leu Arg Asn Leu Tyr 
^385 2390 2395 2400 
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Leu Gly Thr Gly He Lys Glu Glu Glu He Glu Lys Tyr Leu Lys Gin 
. 2405 2410 2415 

Phe lie Lys Asp Leu Pro Gly Tyr He Glu Asd Tyr Asn Glu Asp Val 
2420 2425 " 2430 

Pbe His Gin 3er Gly Thr Val Asp Ala Gly Ala Gin Gly Gly Ser Gly 
2435 2440 2445 

Ser Gin Gly Thr Thr Pro Pro Ala Uir Gly Ser Gly Ala Lys Pro Ala 
2450 2455 2460 

T33r Ser Gly Ala Gly Ser Gly Ser Asp Ihr Gly Ala Gly Itor Gly Val 
2465 2470 2475 2480 

13ir Gly Ser Gin Ala Arg Ite Gly Ser Gly Ihr Gly Thr Gly Ser Gly 
2485 2490 2495 

Ala Thr Gly Gly Gin Ser Gly Ser Gly Ser Gly Thr Glu Gin Val Asn 
2500 2505 2510 

Thr Gly Ser Ala Gly Hir Asn Ala ttr Gly Gly Gin Arg Asp Arg Asp 
2515 2520 2525 

Val Asp Ala Gly Ser Thr Gly Lys He Ser Val Pro Lys Leu Lys Ala 
2530 2535 2540 

Met Ser Lys Lys Met Arg Leu Pro Lys Ala Lys Gly Lys Asp Val Leu 
2545 2550 2555 2560 

His Leu Asp Phe Leu Leu Ihr Tyr Lys Pro Gin Gin Gin Asp He Ser 
2565 2570 2575 

Asn Thr Arg Ala Thr Lys Glu Glu Phe Asp Arg Trp Tyr Aso Ala He 
2580 2585 2590 

Lys Lys Glu Tyr Glu He Asp Asp Ihr Gin Met Thr Val Val tfet Ser 
2595 2600 2605 

Gly Leu Met Val T=p Cys He Glu Asn Gly Cys Ser Pro Asn He Asn 
2610 2615 2620 

Gly Asn Trp Thr Met Met Asp Lys Asp Glu Gin Arg Val Phe Pro Leu 
2625 2630 2635 2640 

Lys Pro Val He Glu Asn Ala Ser Pro Thr Phe Arg Gin He Met His 
2645 2650 2655 

His Phe Ser Asp Ala Ala Glu Ala Tyr He Glu Tyr Arg Asn Ser Ihr 
2660 2665 2670 

Glu Arg Tyr Met Pro Arg Tyr Gly Leu Gin Arg Asn Leu Thr Asp Tyr 
2675 2680 2685 
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Ser ]^ Ala Arg iVr Ala Phe Asp Phe OVr Glu Met -nir Ser Arg 
2690 2695 2700 

Pro Ala Arg .Ua Lys Glu Ala His Met Glji Met Lys Ala Ala Ala Val 
2705 2710 2715 2720 

Arg Gly Ser Asn ISir Arg Leu rbe Gly Leu A^ Gly Asn Val Gly Glu 
2725 2730 2735 

Tbx Gin Glu Asn Ihr Glu Arg His Thr Ala Gly Asp Val Ser Arg Asn 
2740 2745 2750 

Met His Ser Leu Leu Gly Val Gin Gin His His 
2755 2760 

(2) INFORMATICN FOR S3Q ID ND:3: 

(i) SBQUEKCE CHARACTEKISnCS: 

(A) LEN3IH: 20 base pairs 

(B) TfFE: nucleic acid 

(C) STRANDECNESS; single 

(D) TOPOLOGV: linear 

(ii) MDLaCULE TYPE; other nucleic acid 

(A) DESCRIPnGN: /desc = "first Adh internal control 

primer* 
(iii) HYPOTKETrCAL: J30 



(xi) SBCU2N3 OESCRIPTICN: SBQ ID N0:3: 
TGCATGTCGG TTGICTTGCA 
(2) mpORMATICN FOR SSQ ID N0:4: 

(i) SaQUENCS CTARACTSRISnCS: 

(A) LaJGTIH; 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEKESS: single 

(D) TCPCLOGY: linear 

(ii) M0L2CUL2 TYPE: other nucleic acid 

CA) DESCUPnCN: /desc = "second Adh internal control 

pruner" 
(iii) HYPOIHETICAL; NO 



(xi) SSCUaoCE DESCRIPTICN: SSQ ID M0:4: 
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CTCAGCAACT ACCTAGACCA 20 
(2) INFORMATICS PDR SEQ ID N0:5: 

(i) SE3QUENCE CHARACICTISTICS: 

(A) liEUTIH: 19 base pairs 

(B) TVPE: nucleic acid 

(C) STT^ANDmiESS: single 

(D) TOP0UX3Y: linear 

(ii) TTOE; ocher nucleic acid 

(A) DESOlIPncN: /desc = "first synthetic SAT gene 

primer** 
(iii) HYPOIHEmCAL: NO 



(xi) SBQUmiS nESCRIPnCN: SEQ ID N0:5: 
TGrrCTCCGGA GAGGAGACC 19 
(2) INPCERMAnON FOR SEQ ID N0:6: ^ 

(i) CHARACTEiaSTICS: 

(A) 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTICN: /desc = "second synthetic PAT gene 

primer* 
(iii) HYPOTHETICAL; NO 



(xi) SBCUS3CS nESCRIPnON: SEQ ID NO: 6; 
CCAACATCAT GCCATCCACC 20 
(2) INPORMATICN TOR SEQ ID ND:7: 

(i) SEQvJEKCE CHARACTERISTICS: 

(A) L2N3TH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEHJESS : single 

(D) TOPOLOGY: linear 



(ii) >SOLBCULE TYPE: other nucleic acid 

(A) DESCRIPTICN: /desc = "first NIa proteinase gene 

primer" 
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(iii) HyPOIHEncaL; NO 



(xi) SBQOE3S3CS nESOOPnCN: SEQ ID ND:7: 
GCGQGATCCA TGQGC3AAGAA CAAftCQCftGr TGA 
(2) INFTOHMAnCN FOR SBQ ID N0;8: 

(i) SBQUmZE CHaRACTERISnCS: 

(A) IiEt9?IH: 30 base pairs 

(B) TYPE: nucleic acid 
(C} SIHAN0ECMES5: siiigle 
(D) TOPOLOGY: linear 

(ii) ymTiFTULE TOTE: other nucleic acid 

(A) DESClIPnCN: /desc = "second NIa proteinase 

primer" 
(iii) HYPOIHEnCaL: NO 



(xi) SEQUENCE DESaOPnCN: SSQ ID N0:8: 
GCGGAGCTCT TACICTTCAA aSCTOGQSIC 



30 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(POT Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 10 jutc 34 



B, IDENTIFICATtON OF DEPOSIT 



Further deposits are identified on an additional sheet [~) 



Namcofdcposiury instiniiion Agrictatoral Researcta Service oatiire Colleetijaii 

(HRRL) 



Address of depositary inststuiion (i/teiudimg poMiai code atid ceuMtry) 

1815 Hbrth Xtaiverslty Street 

PeoriA, XL 61604 

XJSh 



Date of deposit 

29 Jtme 1995 (29.06.95) 


Accession Number 
NRRL B-21479 


C. ADDITIONAL INDICATIONS (Ita^bUu^ ifiun oppiieahU^ This information is continued on an additional sheet Pj 



We request the 2xpert Solution where available 



D. DESIGNATED STATES FOR WmCH INDICATIONS ARE MADE (if the ini^catiofa are iict for aiitiesipuued States) 



E. SEPARATE FURNISHING OF INDICATIONS (Icas^ bknk if na applicable) 



The indications iisted bciow will be submitted to the Inicmattonai Bureau later (specify the toierai nature of iheindicauonseig^ 'Ac 
Number of Deposit') 



For receiving OCQce use only 



This sheet was received with the incemational application 



Autbohzcd ofScer 



R.LH. Pather 



□ bis 



For International Bureau use onJy 



sheet was received by the Intematioiul Bureau on: 



Authorized ofHcer 



Form PCryRO/134 (July 1992) 
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What is claimed is! 

1 . A chimeric gene comprising a monocotyiedonous plant promoter operabiy linked to a 
nucleotide sequence derived from the genomic sequence of a virus infecting 
monocotyledoneous plants, wherein said nucleotide sequence contains a modification 
rendering a messenger RNA transcribed from said nucleotide sequence incapable of 
complete translation. 

2. The chimeric gene of claim 1 wherein said vims is selected from the group consisting of 
a potyvirus, a luteovims, a tenuiivirus, a camiovirus, a machlovims, a geminiviais and a 
reovirus. 

3. The chimeric gene of ciaim 2 wherein said virus is a potyvirus. 

4. A chimeric gene comprising a monocotyiedonous plant promoter operabiy linked to a 
nucleotide sequence derived from the genomic sequence of a maize dwarf mosaic 
vims, wherein said nucleotide sequence contains a modification rendering a messenger 
RNA transcribed from said nucleotide sequence incapable of complete translation. 

5. The chimeric gene of claim 4 wherein said virus is maize dwarf mosaic vims strain B. 

6. The chimeric gene of claim 4 wherein said transcribed RNA is capable of translating an 
attenuated peptide of a maize dwarf mosaic vims protein. 

7. The chimeric gene of claim 6 wherein said attenuated peptide is less than 20 amino 
acids in length. 

8. The chimeric gene of claim 4 wherein said transcribed RNA cannot be translated. 

9. The chimeric gene of ciaim 4 wherein said transcribed RNA sequence does not include 
the translation initiation codon of said maize dwarf mosaic vims, strain B. 

10. The chimeric gene of claim 4 wherein said transcribed RNA sequence encodes a 
portion of a viral protein selected from the group consisting of a coat protein, a 
proteinase, a replicase, a heiicase. a Vpg protein, a 6K protein and a helper 
component. 
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11. The chimeric gene of claim 4 wherein said modification comprises addition of a 
premature stop codon into said transcribed RNA, 

12. The chimeric gene of daim 4 wherein expression of said gene in transgenic maize, 
sorghum or sugarcane inhibits infection of said transgenic plants by maize dwarf 
mosaic virus. 

13. The chimeric gene of claim 12 wherein expression of said gene in transgenic maize 
inhibits infection of the transgenic plants by maize dwarf mosaic vims. 

14. The chimeric gene of claim 5 wherein said transcribed RNA comprises nucleotides 
4452 to 5744 of SEQ ID No. 1 and said modification comprises the substitution of a T 
for the A at position 4470 of SEQ ID No. 1 . 

15. The chimeric gene of claim 14 wherein said modification further comprises the insertion 
of an ATG codon immediately before the G at position 4452 of SEQ ID No. 1 . 

16. The chimeric gene of claim 4 wherein said monocotyledonous plant promoter is 
selected from the group consisting of a maize ubiquitin promoter, a maize actin 
promoter and a maize phosphoenolpyruvate cartaoxylase promoter. 

17. A method for producing a monocotyledonous plant with an inheritable trait of resistance 
to infection by a maize dwarf mosaic vims comprising transforming said plant with a 
chimeric gene according to claim 4. 

18. A monocotyledonous plant having an inheritable trait of resistance to infection by a 
maize dwarf mosaic vims, wherein said plant comprises a chimeric gene according to 
claim 4. 

19. A chimeric gene comprising a plant promoter operably (inked to a nucleotide sequence 
derived from the genomic sequence of maize dwarf mosaic vims strain B encoding a 
viral protein other than a coat protein, wherein transgenic expression of said chimeric 
gene in a plant inhibits infection of said plant with said virus. 

20. The chimeric gene according to claim 19 wherein said viral protein is selected from the 
group consisting of RNA dependent RNA polymerase (RdRp) having the amino acid 
sequence from position 1915 to 2435 of SEQ ID No. 2. NIa proteinase having the 
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amino acid sequ nee from position 1484 to 1914 of SEQ ID No. 2, helicas having the 
amino acid sequence from position 792 to 1430 of SEQ ID No. 2, and P3 proteinase 
having the amino acid sequence from position 378 to 791 of SEQ ID No. 2. 

21. The chimeric gene of claim 20 wherein said viral protein is a replicase. 

22. The chimeric gene of claim 20 wherein said plant promoter is selected from the group 
consisting of a plant ubiquitin gene promoter, a plant actin gene promoter, and a plant 
pith-prefen-ed promoter. 

23. A method for producing a plant with an inheritable trait of resistance to infection by 
maize dwarf mosaic virus strain B comprising transforming said plant with the chimeric 
gene of claim 19. 

24. A plant comprising the chimeric gene of claim 22. 

25. A method for protecting progeny of a monocotyledoneous parent plant from viral 
infection comprising transforming said parent plant with a chimeric gene according to 
ciaim 1 and obtaining progeny plants or breeding said parent plant with a plant 
according to claim 18. 

26. A method according to claim 25, wherein said progeny are protected from infection with 
maize dwarf mosaic virus. 

27. A method according to claim 25, wherein the progeny of maize, sorghum or sugarcane 
plants are protected from viral infection. 
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